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LEARNING THEORY AND THE NEW “MENTAL 
CHEMISTRY”? 


W. K. ESTES 


Indiana University 


Hovering in the background of our 
scientific enterprises is a question which 
we, as investigators, can afford neither 
to raise very often nor to overlook 
entirely. Namely, do we have good 
reason to believe that the general meth- 
ods and working assumptions underly- 
ing our research can be counted on to 
lead in the long run to satisfactory 
interpretations of our phenomena? In 
the psychology of learning, there has 
been a division of labor, with the ex- 
perimentalists doing the overlooking, 
and each theorist having his turn at 
bringing up this question along with his 
answer. The answer is, of course, in 
each instance that the particular the- 
orist’s tactics point down the high road 
while those of the opposition lead into 
dark and forbidding culs-de-sac. 


1 This paper comprises, in substance, the 
writer’s Presidential Address to the Division 
of Experimental Psychology, American Psy- 
chological Assocation, 1959. Several indebt- 
ednesses deserve acknowledgment: an unre- 
stricted grant from the Ford Foundation 
supported portions of the experimental work ; 
periods of relative freedom from academic 
routine in stimulating surroundings were 
made possible by the Department of Psychol- 
ogy, Northwestern University and the Insti- 
tute for Mathematical Studies in Social Sci- 
ences, Stanford University during the spring 
and summer, respectively, of 1959 while the 
paper was in preparation. 


For my own part, although my hands 
are not entirely clean in the matter of 
theorizing, I had been content until 
very recently to go along with the 
single-minded “learning experimental- 
ist,” assuming that the Lord will look 
after those who remember their control 
groups and mind their Ps and Fs. The 
incident which jarred me out of this 
comfortable way of life began as a 
simple (in fact, as will be seen, un- 
usually simple) bit of experimentation 
with no philosophical overtones. The 
original purpose of the experimentation 
was to provide more cogent empirical 
support than had hitherto been avail- 
able for one of the central concepts of 
learning theory. 

Several recent reviewers (e.g., Deese, 
1958; Estes, 1956; Kendler, 1959; 
Restle, 1959) have noted that despite 
the popular stereotype of “learning the- 
ory” as virtually synonymous with 
“controversy,” there has steadily, al- 
though unobtrusively, accumulated a 
body of concepts and assumptions which 
command relatively wide agreement and 
which contribute motivation and direc- 
tion to a great part of the research 
being done in the field of learning. At 
the center of this core of communality 
one finds a concept which represents the 
distillation of centuries of theorizing 
about learning, not to speak of 70-odd 
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years of experimentation in the tradi- 
tion of functionalism and, later, be- 
haviorism: the concept of associative 
strength. In the “mental mechanics” 
and “mental chemistry” of pre-experi- 
mental associationism, this concept was 
verbalized in terms of the strength of 
associations between ideas ; with Thorn- 
dike, it became strength of stimulus- 
response bonds; with Hull, the basic 
quantitative constructs of habit strength 
and excitatory potential. And it is in 
terms of this concept that contemporary 
learning theorists express the basic dis- 
tinction between learning and perform- 
ance, as well as their fundamental pos- 
tulates relating the growth of habits to 
the number of training trials (“rein- 
forcements”’ ). 

There is little disagreement even in 
the precise quantitative form of these 
postulates, perhaps because our stand- 
ards of acceptability come directly from 
the observed forms of empirical curves 
relating probabilities of learned re- 
sponses, as well as measures of resist- 
ance to extinction or forgetting, to the 
number of training trials. Thus in a 
whole array of contemporary theories 
we find exactly the same equations ex- 
pressing the effect of a reinforced train- 
ing trial upon associative strength (cf. 
Restle, 1959). In Hull’s system, this 
equation is 


AH=k(M—-H) 


where AH represents the change in 
habit strength (H) on any reinforced 
trial, M is the maximum value of habit 


strength, and k is a constant. Spence’s 
(1955) revision of Hull’s theory is 
based on the same postulate, although 
there is some question as to whether the 
H in the equation should be replaced 
by E, representing excitatory potential 
(Spence, 1955, 1958). Similarly, in 
statistical learning theory (Estes, 1959) 
the corresponding assumption has been 
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expressed in the form 
Ap=6(1—)p) 


where A p represents the change in the 
proportion of stimulus elements con- 
nected to a given response, and @ is a 
constant (the proportion of stimulus 
elements sampled on the trial). In the 
stochastic model of Bush and Mosteller 
(1955), the same linear function is as- 
sumed, “strength” being simply iden- 
tified with response probability. For 
application to learning theorists, a well- 
known quotation may aptly be reversed 
to read, “In strength there is union.” 
And with respect to the experimental 
literature, it seems fair to say that the 
assumptions that associative strength 
increases with reinforcement, decreases 
during retention intervals, and general- 
izes to new stimuli are the cornerstones 
of most contemporary treatments of 
conditioning and elementary verbal 
learning. 

Areas of substantial agreement con- 
cerning either facts or interpretations, 
let alone both, are rare enough in the 
field of learning so that one might think 
we would do well to treasure the one 
we have located and carefully avoid 
doing anything to disturb it. This I 
(along with most of my fellow theorists 
and experimentalists) was happy to do, 
until my hand was forced by some 
purely experimental developments. The 
first of these was the, by now well- 
known, work of Kimble and his asso- 
ciates (1955, 1956; Dufort & Kimble, 
1958) who, working with an eyelid 
conditioning situation, tried the novel 
procedure of omitting the CS on a sub- 
stantial b'ock of trials during an acqui- 
sition series. The rather surprising re- 
sult of this variation was that the course 
of acquisition was virtually unaffected. 
Their interpretation was that only the 
first few reinforcements are actually ef- 
fective in modifying the strength of 
associations between CS and CR, the 
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rest of the “conditioning curve” actually 
reflecting some nonassociative process. 
The second development was contrib- 
uted by Rock (1957), who found no 
retardation in the speed of paired-asso- 
ciate learning when he introduced the 
device of replacing missed items with 
new ones at the end of every trial. 
Rock interpreted his findings as indicat- 
ing that associations form on an all-or- 
none basis and that it is only associa- 
tive strength in the sense of resistance 
to forgetting that grows as a function of 
number of reinforcements. 

It seemed curious that both of these 
deviations from the usual experimental 
paradigms had yielded results which 
are in some respects sharply at variance 
with the generally accepted conception 
of the acquisition process. Of course 
these findings might turn out to have 
special explanations that would leave 
the established conception undisturbed. 
3ut then again, they might not.. In the 
case of the Kimble, Mann, and Dufort 
(1955) study, counterexplanations and 
‘ounterexperiments have already begun 
to pour into the literature (see, e.g., 
Goodrich, & Wagner, 1957) 
with the result so far of complicating 
still further rather than of 
clearing anything up. One might argue 
that concepts and assumptions which 
have been supported by a large accum- 
ulation of experimental findings cannot 
be seriously threatened by one or two 
apparently negative results. But this 
argument is weakened by the observa- 


Ross, 


matters 


tion that all of the empirical support 


for the conceptualization of learning in 
terms of associative strength is quite 
indirect in character. 

The basic concepts and assumptions 
of learning theory are universally sup- 
posed to refer to states and processes of 
the individual organism. Yet the exist- 
ing evidence for the assumption that 
associative strength is an increasing 
function of number of reinforcements 
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comes from performance curves repre- 
senting average response measures over 
groups of learners, or from measures 
of resistance to extinction or retention 
scores averaged over groups of learners 
having different values of the independ- 
ent variable. Even the few bits of neg- 
ative evidence are indirect, depending 
on performance curves obtained under 
deviations from the usual experimen- 
tal paradigms but still representing 
changes in average scores over series 
of trials for groups of subjects (Ss). 
It would seem that if our basic con- 
ceptions are sound, it should be possible 
to cut through the web of group per- 
formance curves and obtain more direct 
and compelling evidence for the exist- 
ence of the assumed states and proc- 
esses in individual organisms. This, in 
any event, is what I set out to accom- 
plish for the concept of associative 
strength in a series of experiments now 
to be reported. 


ON THE DEFINITIONS OF “REINFORCE- 
MENT,” “TEST TRIAt,” AND 
“LEARNING” 


In standard human learning situa- 
tions, “learning” is almost -universally 
defined and measured in terms of a 
change in the probability, er frequency, 
with which a given stimulating situa- 
tion evokes a response (or instances of 
a response class) that has been desig- 
nated as “correct” by the experimenter. 
With one reservation, to be noted be- 
low, I shall follow this usage. But the 
situation is quite different with “rein- 
forcement,” the same term being used 
in at least two quite different senses by 
different investigators and thus promot- 
ing no end of confusion. My own 
habitual usage is the “neutral defini- 
tion” (Hilgard, 1956, p. 409) which 
identifies reinforcement er dirically with 
the operation that is supplied by the 
experimenter in order to produce learn- 
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ing, as defined above, in any given sit- 
uation. In a paired-associate situation, 
the reinforcing operation is the paired 
presentation of the stimulus and re- 
sponse members of an item; in classical 
conditioning it is the paired presenta- 
tion of CS and US;; in verbal condition- 
ing, the reinforcing operation for a 
given predictive response (e.g., predict- 
ing that the left light will appear) is the 
occurrence of the corresponding event 
(appearance of the left light)—in each 
case without regard to whether the S 
correctly anticipated the response mem- 
ber of the paired-associate item, gave a 
CR prior to occurrence of the US, or 
correctly predicted the event on the trial 
in question. The only property that 


different types of reinforcing operations 
are assumed to share is their common 
quantitative effect. on the conditional 
probabilities of the possible alternative 
responses to the stimulating situation 
in which reinforcement occurs. 

A narrower definition, favored espe- 


cially by writers associated with a 
drive-reduction interpretation of rein- 
forcement, would limit the term rein- 
forcement to an operation that follows 
and is contingent upon the occurrence 
of the reinforced response on any trial. 
In this usage, reinforcement in paired- 
associate learning occurs only when the 
S has made a correct response in antic- 
ipation of the paired stimulus-response 
presentation, and reinforcement in ver- 
bal conditioning occurs only on trials 
when the S correctly predicts the tria 
outcome. Whether, according to this 
view, reinforcement occurs on only those 
trials of a classical conditioning experi- 
ment on which a CR occurs prior to the 
US depends upon theoretical decisions 
as to whether the CR and UR are “the 
same response” and whether reinforce- 
ment occurs at the onset or the termina- 
tion of the US. 

The primary advantage I see in the 
“neutral definition” is that it can be 
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applied in an objective and consistent 
manner independently of one’s position 
on systematic or theoretical 
Learning certainly may occur prior to 
the first correct anticipation in a paired- 
associate experiment, prior to the first 
correct prediction in verbal condition- 
ing, prior to the first CR in classical 
conditioning. The present usage per- 
mits us to speak, for example, about 
changes in probability of a response as 
a function of reinforcements on trials 
preceding its first occurrence, on the 
one hand, and changes as a function of 
reinforcements on trials including and 
following its first occurrence, on the 
other, without changing our definition 
of reinforcement. 

It should be emphasized that the 
neutral definition does not beg such 
questions as whether presentation of a 
US following a CR constitutes the same 
reinforcing operation as presentation of 
a US on a trial when the CR did not 
occur; these two procedures represent 
instances of the same reinforcing opera- 
tion if and only if they produce the 
same change in the probability of evoca- 
tion of the CR by the CS. However, 
it seems strategic to avoid issues of this 
sort when, as in the present investiga- 
tion, we are concerned with the nature 
of the changes in response tendencies 
during learning rather than with the 
conditions giving rise to these changes. 
Consequently, in the experiments to be 
reported, we have attempted so far as 
possible to avoid the customary con- 
founding of reinforcement with ante- 
cedent response. In paired-associate 
situations, for example, we have de- 
viated from the usual anticipation pro- 
cedure by separating the reinforcement 
(paired-presentation of stimulus and 
response members of an item) from the 
test for learning (presentation of the 
stimulus member alone) so that an item 
may receive more than one reinforce- 
ment before the first test trial or may 


issues, 
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receive repeated test trials without in- 
tervening reinforcement. 

For purposes of measuring retention, 
it would be ideal if one could give test 
trials on which no learning at all oc- 
curred. 
is conceived solely in terms of the def- 
inition given above (increase in prob- 
ability of the “correct’”’ response to a 
given stimulus), this goal is not too 
difficult to approximate. It seems in- 
tuitively clear, and can be demonstrated 
empirically (Estes, Hopkins, & Croth- 
ers, 1960), that no systematic increase 


: ‘” 
Indeed, so long as “learning 


in probability of correct responses to, 
say, paired-associate items will occur 
series of trials on the 
stimulus members are presented alone 


over a which 
and the S’s responses receive no re- 
ward or informational feedback 
the experimenter. We cannot, how- 
ever, rule out the possibility that on 
these trials there might be learning in 
the sense of an increase in probability 


from 


of whatever responses, correct or incor- 
rect, actually occur. In fact, there is 
evidence that such learning does occur, 
but at a relatively low rate compared 
to the learning that occurs on rein- 
forced trials (Estes et al., 1960). Con- 
sequently, in the analyses to follow, | 
shall that trials 
can be treated, without serious error, 
simply as test trials 
primary interest is in measuring the 
effects of preceding reinforced trials. 


assume unreinforced 


“neutral” when 


UNITARY ASSOCIATION VS. 
Hasit HIERARCHY 


If the stimulus-response relation es- 
tablished by reinforcement were a uni- 
tary, all-or-none 
sort, then the learning of a new re- 


connection of some 


sponse to a stimulus would automat- 
ically displace a previously associated re- 


sponse. Contrariwise, in every variant 


of the concept of associative strength, it 


is assumed that a number of different 


responses may simultaneously be asso- 
ciated with the same stimulus, the rela- 
tive strengths of association depending 
primarily upon preceding frequencies 
of reinforcement of the different re- 
sponses in the presence of the stimulus. 
This latter assumption has been em- 
bodied in Hull’s “habit family hier- 
archy.” 

To spell out the question at issue in 
experimental terms, let us suppose that 
first Response A and subsequently Re- 
sponse B have been reinforced in the 
presence of a given stimulus (or stim- 
ulus complex—the nature of the stim- 
ulation does not matter so long as it is 
the same on each trial) for each mem- 
ber of a group of Ss and that a test trial 
now reveals an observed probability 
(relative frequency) of .5 for each re- 
sponse. Does this mean that for each 
S both Response A and Response B 
now have approximately equal habit 
strengths and therefore equal probabil- 
ities of evocation by the given stimulus? 
Such is the interpretation required by 
the conception of associative strength 
(habit strength, excitatory potential, 
proportion of conditioned elements, or 
whatever) as well as by any of the con- 
temporary models formulated in terms 
of continuously variable response prob- 
abilities for individual Ss. Stated in 
these concrete terms, however, it does 
not seem that the assumption need de- 
pend for support only on extremely 
indirect evidence, as has hitherto been 
the case. To test these implications of 
the strength concept quite directly, we 
apparently need only reinforce two dif- 
ferent responses to the same stimulus 
for an individual S, and then, by means 
of a series of unreinforced test trials, 
give the two responses an opportunity 
to exhibit their relative strengths—if 
such exist. The predictions under test 
would be straightforward. On the hy- 
pothesis of associative strength, or habit 
family hierarchy, we expect individual 
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Ss to shift back and forth between the 
previously reinforced responses 
over a series of tests. On the hypothe- 
sis of a unitary, all-or-none association 
between stimulus and response, we ex- 
pect that an individual S who makes 
Response A, say, on the first test trial 
with a given stimulus will not shift to 
B on subsequent tests. 

A paired-associate learning situation 
offered a number of convenient features 
for an empirical realization of the hypo- 
thetical experiment just described. The 
stimuli and reinforcements are readily 
controllable ; and, by embedding a stim- 
ulus to which two different responses 
are to be reinforced in a conventional 
list, one can make S’s task appear little 
different from familiar learning situa- 
tions. For each of 20 Ss, the proce- 
dures were replicated with six eight- 
item lists, all made up on the same 
principles.? The stimuli were all 
sonant syllables and the responses one- 
Half of the 


] 
i 


two 


con- 


syllable words. items in 


each list had single correct 


g 
examples are: 


responses ; 


RESPONSE 
wish 
sped 


STIMULUS 
HTX 
JFR 


(hese were simply “ballast” and will not 
be considered further. The other half 
of the items had two correct responses 
for each stimulus ; examples are: 


STIMULUS RESPONSES 
DGR 


BCG 


thaw, weep 


pink, rule 


The two correct responses to each stim- 
ulus were reinforced equally often for 
each §. By “reinforcement,” in this 
context, we mean simply a paired pres- 
entation of stimulus and correct re- 
sponse to S; by “test,” a presentation 
of the stimulus member of an item 

2 This experiment was conducted at In- 
diana University with the assistance of E. J 
Crothers. 
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alone. In half of the replications, there 
were exactly two reinforcements pre- 
ceding the test trials, one on each re- 
sponse; in the remaining replications, 
there were four reinforcements, two on 
each response, prior to the test trials. 
Following the reinforcements, a series 
of unreinforced test trials was given, 
the order of the stimuli being random- 
ized anew for each test. 

The critical data for our purposes are 
the proportions of cases in which the 
response given on the first test trial to 
a given stimulus was repeated on the 
According to an interpre- 
tation in terms of associative strength, 
or habit family hierarchy, repetitions 
(AA or BB) and shifts (AB or BA) 
should occurred with roughly 
equal frequencies.* According to the 
notion of a unitary association, AA and 
BB should have each occurred about 
half of the time and the shifts AB and 
BA The results were as 
follows : Following two reinforcements, 


second test. 


have 


" 
all 


not at 


there were 85% repetitions and 15% 
shifts on the first two test trials, the 
percentages being based on all items 
that had correct responses on both tests. 


reinforcements, these 
values were for repetitions and 
only 11% for shifts. On later pairs of 
tests, the frequencies of AB and BA 
shifts were even smaller. Thus the 
results do not offer very impressive 
support for the assumption that a habit 
hierarchy exists in the individual S fol- 
lowing the reinforcement of two differ- 
ent responses to the same stimulus. On 
the other hand, the data appear quite 
harmonious with an assumption of uni- 
tary association, since the proportions 
of AB and BA shifts appear small 


four 
89% 


Following 


8 The response given on the first test trial 
was the more recently reinforced response in 
approximately 60% of cases. Thus more in- 
stances of AA than BB would be expected; 
nevertheless the proportions of repetitions and 
shifts expected on the associative strength 
hypothesis are nearly equal (.52:.48). 
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enough to be attributable to minor un- 
controlled factors such, for example, as 
fluctuations in context (background 
stimulation, stimulus traces from pre- 
ceding items) from one trial to the 
next.* 
Although experiment 


the present 


failed to produce the anticipated direct 
support for the concept of associative 


strength, a “strength theorist,” partic- 
ularly one working within the frame- 
work of Hull’s system, may not be too 
disturbed. With the benefit of an ad 
hoc assumption concerning the range of 
“behavioral oscillation,” the model of 
Hull and Spence can be made to yield 
the prediction that, of two responses 
reinforced to the same stimulus, one or 
the other will dominate on test trials, 
thereby preventing shifts from one to 
the other on the part of individual Ss. 
In order to obtain a more decisive test 
of the strength conception, we evidently 
require an even simpler experiment in 
which only one response is ever rein- 
forced. 
With the 


indefinite regress through a series of 


forestalling an 


hope of 


progressively more refined experiments, 


4It should be noted that these results cast 
doubt on the concept of habit hierarchy only 
as applied to response probabilities in the 
presence of a stimulus situation that has been 
unit over the series of 
reinforced and test trials. M. S. Schoeffler 
(personal communication) and, in another 
unpublished study, Crothers and I have ob- 
tained positive evidence for the existence of a 


manipulated as a 


sociated with a compound 


been 


habit hierarchy as 
stimulus, components of 
separately with 
during the training trials. 

compounding provide a control for the pres- 
high proportions 


which have 
reinforcements 
The results with 


correlated 


ent experiment.- If the very 
of repetitions reported above were attrib- 
utable to learning that occurred on the first 
test trial, or to some nonassociative variable, 
then a similar excess of repetitions should 
have occurred over with 
stimulus compounds; however, no such ex- 
cess was observed either in Schoeffler’s study 


successive tests 


of compounding or in ours 
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suppose we ask what is the minimum 
set of operations and observations ac- 
tually needed in order to demonstrate 
learning. Normally there must be a 
pretest in order to determine the initial 
probability of the to-be-learned behav- 
ior inthe test situation; in practice the 
experimenter often has a priori infor- 
mation about initial response probabil- 
ities which makes the pretest dispens- 
able. There must be a presentation of 
some reinforcing operation, and after- 
ward a test to assess the change in 
performance produced by the reinforce- 
ment. If the function of response oc- 
currences and nonoccurrences is to be 
determined, there will have to be a 
second test trial. And there we have it. 
Controlled comparisons relative to ef- 
fects of the principal events occurring 
during an acquisition series can, in 
principle, be accomplished in an experi- 
ment running to about a trial and a half. 
3y usual standards, this constitutes 
what can only be called a “miniature 
experiment.” However, miniature ex- 
periments appeared to be what the tac- 
tical situation called for, and therefore 
miniature experiments are what we set 
out to run. 


CONCEPTIONS OF THE ACQUISITION 
Process: ASSOCIATIVE STRENGTH 
vs. ALL-Or-NoONE MopDELs 


In the first of these experiments,’ we 
used a paired-associate situation with 
consonant syllables as stimuli and num- 
bers as responses. Forty-eight Ss were 
run with an eight-item list, yielding 384 
observations on the first test trial. The 
principal portion of the experiment con- 
sisted simply in presenting each S once 
with each stimulus-response pair and 
then testing with each stimulus alone 
(in a new random order). Before pro- 

5 This experiment was conducted at In- 
diana University with the assistance of B. L. 
Hopkins; for a full report of the method and 
results see Estes et al. (1960). 
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Reinforcement 


Fic. 1. Schema representing effects of a 
single reinforcement according to incremen- 
tal (upper branch) vs. all-or-none (lower 
branch) theories. Squares represent Ss, 
with the proportion of darkened area in each 
indicating the probability of the correct re- 
sponse (C) for the given individual 


ceeding to the results, let us examine 
the outcome expected on the basis of 
the notion of learning as a change in 
associative strength. In Figure 1 the 
situation is schematized in terms of a 
single item. The four squares at the 
left represent four hypothetical Ss, the 
emptiness of the squares indicating that 
all start the experiment with zero prob- 
abilities of making the correct response. 
Now we give a single reinforcement 
(paired presentation of the stimulus 
and correct response), the result of 
which is to raise the probability of the 
correct response (C) to, say, .25. The 
upper arrow leads to the theoretical 
state of affairs after this reinforcement, 
according to an interpretation based on 
the conception of associative strength. 
The strength of the association is in- 
creased for all of the Ss; and, neglect- 
ing for the moment possible individual 
differences, the probability of the cor- 
rect response is now .25 for each indi- 
vidual, the one at the upper right who 
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happened to make a correct response 
on the test, and the three who did not. 

Suppose now that the interpretation 
based on the concept of strength were 
completely wrong and that stimulus- 
response associations really formed on 
an all-or-none basis. Then the state of 
affairs after the reinforcement should 
be as shown in the lower part of the 
figure. Again the probability of a cor- 
rect response increases from zero to .25, 
but the .25 now refers only to the 
group, not to any individual S. One § 
has formed the association (darkened 
square ), and three have been unaffected 
by the reinforcement (empty squares). 

To distinguish empirically between 
these two logically possible outcomes, 
we need only add the remaining half- 
trial to our trial-and-a-half, i.e., give 
another test without intervening rein- 
forcement. Now, if the upper branch 


of the diagram is essentially correct, all 
Ss should have equal probabilities of 


making the correct response on the 
second test trial, regardless of what 
they did on the first test. But if the 
lower branch is correct, correct re- 
sponses on the second test should come 
only from Ss who made correct re- 
sponses on the first test. None should 
come from Ss who made incorrect re- 
sponses (N) on the first test, for these 
Ss would not have profited at all from 
the learning trial. 

There might be some attenuation of 
the expected proportions of correct re- 
sponses on Test 2 by Ss making correct 
responses on Test 1 if there is any for- 
getting in this situation, but the pro- 
portions of correct following incorrect 
provide a critical comparison. If the 
all-or-none view is correct, then this 
proportion should be zero, or at least 
no greater than could be achieved by 
sheer guessing. But if any version of 
the strength conception is correct, then 
the proportion of correct following in- 
correct responses should be greater than 
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chance. In order to make the outcomes 
that can be tolerated by the two inter- 
pretations sharply different, we need 
only choose our experimental materials 
and conditions so that the overall pro- 
portions correct on both first and sec- 
ond tests are well above chance. It can 
be seen in Figure 2 that this has been 
achieved, for approximately 50% of the 
items were correct on the first test and 
nearly 40% on the second. Consider- 
ing the critical branch of the 
diagram, leading from an incorrect re- 
sponse on the first test to a correct re- 
sponse on the second, we see that the 
results lean strongly in the direction 
prescribed by an all-or-none conception, 
for the 9% of correct following incor- 
rect responses is less than the 1214% 
that could be achieved even by rather 
unintelligent guessing with an eight- 
item list if the reinforcement had no 
effect at all on these items. The differ- 
ence between this value and the 71% 
of correct following correct responses is 
so large that a statistical test would be 
an empty formality. 

A possible defense that might be ad- 
vanced by a “strength theorist” is the 
hypothesis that the 51% of cases with 
incorrect the first test 
simply represent preponderantly slower 
learners or more difficult items than the 
49% responses. 
If so, then a control condition in which 


lower 


response Ss on 


of cases with correct 


a second reinforcement 


tween the first and second tests should 


is given be 


yield a percentage of correct responses 
on Test 2 following incorrect on Test 1 
that is much smaller than the percen- 
tage correct on Test 1. This control 
was run (with the same 48 Ss but dif- 
ferent and the result is 
in Figure 3. The effect of the first re 
inforcement on the full set of Ss and 
items was to raise the probability of a 
correct response from near zero to .40; 
the effect of the second reinforcement 
on cases having incorrect responses on 


items), shown 
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the first test was to raise the probability 
of a correct response from near zero to 
.46. Thus there seems to be no support 
forthcoming for the hypothesis of a 
large difference in learning rate be- 
tween cases which did and cases which 
did not have correct responses on the 
first test. 

Although it would be nice to claim 
credit for rare prescience in predicting 
the outcome of this little experiment, 
the fact is that the result came as a 
distinct jar to my preconceptions. In 
designing the study, our idea was not 
to undermine the strongly entrenched 
concept of associative strength, but to 
support it by showing that the results 
of Rock’s experiments, apparently call- 
ing for an all-or-none interpretation, 
must be attributed to some artifact con- 
cealed in his ingenious but somewhat 
complex procedures. Thus when Hop- 
kins and I examined the data from our 
initial group of 24 Ss and found the 
pattern shown in Figures 2 and 3, our 
first reaction was to replicate the whole 
thing with another group. But when 
the two replications turned out to agree 
in every essential respect, we were left 
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lic. 2. Results of miniature experiment 
on acquisition of paired-associates. Empiri- 
cal values are proportions of instance in 
which correct (C) and incorrect (N) re- 
sponses on first test trial after a single 
reinforcement (paired presentation of stim- 
ulus and response members) were followed 
by C and N responses on a second test trial 
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Fic. 3. Proportion correct on test after 
a second reinforcement for cases not having 
correct responses on first test compared with 
proportion correct on first test for the full 
set of Ss and items. The Ss and situation 
are the same as those represented in Figure 2. 


with no obvious course but to begin 
digesting an unanticipated and not en- 
tirely palatable conclusion. The most 
cleanly controlled comparisons we had 
managed to devise yielded no evidence 
that repeated reinforcements in this sit- 
uation have any function other than to 
give repeated opportunities for the dis- 
continuous formation of a learned asso- 
ciation between stimuli and 
responses. 

Still, it is well known that theoretical 
doctrines do not yield readily to nega- 
tive evidence. One whose theories are 
based on a concept of strength will lose 
little ground if he can make a stand on 
the claim that all-or-none acquisition is 
simply a peculiarity of the paired-asso- 
ciate experiment and not characteristic 
of human learning in general. To eval- 
uate this possible defense of the strength 
concept, we clearly shall have to turn 
to some different situation that is quite 
different in the response mechanism 
and reinforcing operations from paired- 
associate learning. Eyelid conditioning 
meets these specifications, and it is 
convenient for our purposes since a 
colleague, I. Gormezano, has kindly 
made available his data: from an in- 


observed 


tensive period of data collecting in 
the Wisconsin conditioning laboratory. 
Gormezano trained a sufficiently large 
group, approximately 170 Ss, under 
identical conditions so that the first few 
acquisition trials can be treated as one 
of our miniature experiments and ana- 
lyzed in much the same way as the 
paired-associate study. 

The situation obtaining over the first 
couple of trials is schematized in Figure 
4. In the diagram, T, is the first CS 
presentation, prior to the first reinforce- 
ment, and we shall consider only Ss 
who made no CR on this test. Thus 
the initial probability of a CR is taken 
to be zero. Suppose now that the 
effect of the first reinforcement is to 
raise the probability of a CR to .25. 
According to the strength conception, 
shown in the upper panel, each S has 
his strength of conditioning increased 
by the same amount by this reinforce- 
ment and now has probability .25 of 
making a CR. Then the second re- 
inforcement increases the conditioned 
strength for each S again ; and, regard- 
less of whether or not a particular S 
happened to make a CR on T,, he now 
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Fic. 4. Schema for first two trials of eye- 
lid conditioning experiment showing changes 
in CR probability (proportion of darkened 
area in squares representing Ss) prescribed 
by incremental vs. all-or-none theories. 
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has a higher probability (%@ if we 
apply the linear function mentioned 
earlier). According to an all-or-none 
conception, the situation after the first 
reinforcement, shown in the lower 
panel, is that for 14 of the Ss the CR 
has become associated with the CS and 
for the remaining 34 of the Ss no con- 
ditioning has occurred. The effect of 
the second reinforcement is to give the 
unconditioned Ss another chance, and 
14, of these now become conditioned. 
The differential prediction, then, con- 
cerns the probability of a CR on the 
third test for Ss who made ne CR on 
the second test (and similarly the prob- 
ability of a CR on the fourth test for Ss 
who made none on any test, 
and so The strength aT 
requires this conditional probability to 
all-or-none con- 


pre Vv ious 


on ). 


increase, whereas the 
ception it to remain constant. 
The test seems quite sharp, for even with 
allowance for variation in conditioning 
rates among Ss, a model which assumes 
that associative strength increases with 
reinforcements cannot stand constancy 
assump- 


requires 


unless its 
that it reduces to 


of this pro ybability 
so restricted t 
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tions are 
an all-or-none moc 
The pertinent results of Gormezano’s 
study are Figure 5, carried 
through the first four trials, beyond 
which the number of cases begins to 
drop off too much for comfort. In- 
the sequence of probabilities 

of after 1, 2, or 3 consecutive 
NCRs 15, .14—we find the hy- 
pothesis of constancy appearing rather 
than 


shown 


more attractive the progressively 
® For this example, the parameter @ in the 
function Ap=6(1—>p) is equal to % and 
after the first experiment p is also equal to 
44. Therefore we have 
ap=%(1— %) = Meo 
after the second 


and for the probability 


reinforcement, 


p+ Ap=Y%t Ke —hKe 
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Fie. 5. hai acquisition data from 
Gormezano’s study of eyelid conditioning. 
Values of particular interest are the propor- 
tions of CRs following 1, 2, or 3 consecutive 
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increasing trend required by the strength 
interpretation. (According to a linear 
model, for example, the value .15 for a 
CR after an NCR should have been .24, 
and the .14 for a CR after two NCRs 
should have been .34.) 

The consistency of these conditioning 
data with those of the paired-associate 
situation is almost too good to be true. 
In psychology we are not used to hav- 
ing quantitative tests of alternative the- 
oretical notions yield such apparently 
decisive outcomes. Consequently, and 
considering the importance of the the- 
oretical issue, perhaps we will not yet 
be accused of beating a dead hypothesis 
if we look for one more test with ex- 
perimental arrangements differing from 
both of those preceding. We would 
like a situation similar to paired-asso- 
ciate learning in that unreinforced test 
trials can readily be given without dis- 
turbing the learners but one which elim- 
inates the possibility of achieving sub- 
stantial proportions of correct responses 
by guessing. A situation which meets 
these desiderata is the free verbal recall 
experiment used by Bruner, Miller, and 
Zimmerman (1955). For our present 
purposes the minimal experiment will 
consist of a single reinforcement fol- 
lowed by two recall tests. The rein- 
forcement involves merely the experi- 
menter’s reading a list of words aloud 
to S. Ona recall test, S is asked to 
write down as many words as he can 





W. K. 





—L 25. 


.005 
39 
54 ¢ 
46 
02 

98 yy 


Te 
T5 ¢ 
N 


Cc 





CASES 





v2 N 


Fic. 6. Results of two miniature experi- 
ments on free verbal recall, showing near- 
zero proportions of correct responses on a 
second test trial for cases which did not have 
correct responses on the first test after a 
single reinforcement. 


remember (in any order). Then after 
an interval during which no additional 
reinforcement is given, S is (unex- 
pectedly) tested again. 

Results of two experiments * 
sort are shown in Figure 6. The upper 
tree represents an experiment with 35 
Ss, each given a list of eight words at 
R,. On the first test, T,, 61% of the 
280 opportunities for correct responses 
(C) yielded either incorrect responses 
or omissions (N), and of these less 
than 1% were followed by correct re- 
sponses on the second test, T,. Ina 
replication conducted with some minor 
variations in procedure, 102 Ss were 
presented with eight words each on the 
reinforced trial. This time (lower tree 
in Figure 6) there were 72% N re- 
sponses on the first test and less than 
2% of these were followed by C re- 
sponses on the second test. Clearly, if 
a word is not given correctly on the 
first test by a particular S, the chances 
are virtually nil that it will be correct 
on a second test. 


of this 


7 These experiments, conducted at Indiana 
University with the assistance of Judith 
Crooks, will be reported in detail elsewhere. 
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This result does not, of course, 
prove that reinforcement has exerted no 
strengthening effect on the associations 
in the cases when the correct response 
failed to occur on the first test. But 
one whose theory requires him to as- 
sume that such strengthenings occur 
has a taxing assignment in producing a 
case for the existence of factors or proc- 
esses which appear in just sufficient 
force to cancel out the hypothesized 
increments in response strength under 
each set-of experimental procedures we 
have examined. Explanations depend- 
ing on such factors as individual differ- 
ences are not very prepossessing in the 
light of control comparisons of the type 
exhibited in Figure 3 (a similar con- 
trol, with a similar result, was used for 
the free-verbal-recall situation). One 
might appeal to the effects of learning 
which occurs on the first test trial itself, 
arguing that an incorrect response which 
occurs on the first test receives a large 
increment in associative strength (from 
sheer contiguity or perhaps from some 
unspecified source of reinforcement ) 
and therefore recurs with high prob- 
ability on the next test. One important 
difficulty with this hypothesis is that the 
data do not support it. In the paired- 
associate study cited above, for ex- 
ample, the observed relative frequency 
with which an incorrect response oc- 
curring on the first test was repeated 
on the second test was only .24. Inter- 
pretations which preserve the incremen- 
tal conception of associative learning 
should certainly be sought with all 
vigor ; at the time of writing, however, 
none has come to my attention that 
seems at all plausible. 


RETENTION AS A FUNCTION OF REIN- 
FORCEMENT AND INTERSPERSED 
Test TRIALS 

The story does not end here. Even 
if one is ready to grant that associations 
are made and broken in an all-or-none 
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fashion, this concession, although by no 
means a minor one, does not exhaust 
the resources of the strength concept. 
It is possible that after an association 
has once formed, associative strength in 
the sense of resistance to forgetting con- 
tinues to grow as a function of rein- 
forcements. In fact the experimental 
situations we have used appear well 
suited to demonstrate such an effect. 

It will not have escaped notice that 
although the three miniature experi- 
ments yielded sharply negative results 
relative to the conception of learning as 
a gradual strengthening of associations 
by reinforcement, they agree only in 
part with the usual notion of all-or- 
none learning (as, for example, defined 
by Voeks, 1955). The formation of 
learned associations appears to be dis- 
continuous rather than gradual, but 
once a correct response (or conditioned 
response ) has occurred to a given stim- 
ulus, it does not appear with certainty 
when the stimulus recurs after a rest 
interval. In fact, “forgetting” as meas- 
ured by the proportion of shifts from a 
correct response on the first test to non- 
correct on the second test ranges from 
about 20% in the paired-associate sit- 
uation to about 50% in free verbal 
recall. For the eyelid conditioning 
study, we did not report these propor- 
tions since the numbers of observations 
on which they were based were rather 
small; but for what it is worth, we 
might add that the proportions of CR 
to NCR shifts following the first three 
tests, despite the intervening reinforce- 
ment, fall well within the range given 
above for the other two experiments. 

The minimum set of operations and 
observations required to test for a de- 
pendence of “resistance to forgetting” 
upon number of reinforcements is ex- 
hibited in Table 1. 
one or two reinforcements, a sequence 


Following either 


of two unreinforced test trials is given; 
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and retention is measured in terms of 
the proportion of instances in which a 
correct response to a given stimulus on 
the first of these tests is repeated on 
the second test. Results of two such 
experiments are shown in the table. 

The left-hand column of proportions 
represents data from a study conducted 
by the writer with the assistance of 
E. J. Crothers (Estes et al., 1960). 
Twenty Ss were each run on six eight- 
item paired-associate lists under each 
condition (number of reinforcements). 
The stimuli were nonsense syllables, 
and the responses were numbers in half 
the lists and familiar words in the other 
half. The right-hand column presents 
data from the free verbal recall experi- 
ment previously cited (102 Ss each 
tested with a list of eight items under 
each condition). The pattern of test 
proportions needs little comment. In 
each case the difference called for by 
the traditional conception of associative 
strength fails to appear. 

This outcome is a little hard to swal- 
It is well known that retention 
If the ad- 
ditional reinforcements given during the 
overlearning period do not produce the 
increased resistance to forgetting, then 
what does? 


low. 
increases with overlearning. 


We may obtain a clue as 
to the answer by introducing what 
might a priori seem to be very slight 
change in the design of our minimal 
experiment on retention. This variable 
is illustrated in Table 2. In the free 


TABLE 1 


DESIGN AND RESULTS OF MINIMAL EXPERI- 
MENT ON RETENTION AS A FUNCTION 
OF NUMBER OF REINFORCEMENTS 


Amount of Retention 
Procedure P-A FVR 
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TABLE 2 


DESIGN AND RESULTS OF MINIMAL EXPERI- 
MENT ON RETENTION AS A FUNCTION 
oF A Test TRIAL INTERPOSED 
BETWEEN REINFORCEMENTS 


r 


Amount of 


Procedure Retention 


R RT, Ts 
RT, RT: Ts 


verbal recall experiment cited previ- 
ously, we included, in addition to the 
condition shown in the upper row 
(which is the same as in Table 1), a 
condition with the same number of re- 
inforcements but with an additional 
interspersed test trial. The idea was 
to give an opportunity for correct re- 
sponses learned on the first reinforced 
trial to become conditioned to cues 
which are present only on test trials 
and not on training trials—in other 
words to spread the effect of the first 
reinforcement over more stimuli. Re- 
tention is measured in terms of the pro- 
portion of correct responses repeated 
from T, to T, in the upper row and 
from T, to T, in the lower row. In 
contrast to the lack of effect on reten- 
tion produced by increasing the number 
of reinforcements, note the large effect 
of the added test trial. Two reinforce- 
ments without an intervening test yield 
only 52% retention, whereas two rein- 
forcements with an intervening test trial 
yield 78% retention from the first to 
the second test following the second 
reinforcement. 

Although the principle that retention 
increases with number of reinforce- 
ments is exceedingly well established in 
the lore of human learning, we must 
face the possibility that this empirical 
relation, like the classical acquisition 
curve, is an artifact of the confoundings 
inherent in the usual experimental par- 
adigm. 
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CONCLUDING REMARKS 


To recapitulate the box score: we 
have conducted a series of highly sim- 
plified experiments especially designed 
to provide relatively direct evidence for 
the widely accepted interpretation of 
learning and retention in terms of incre- 
ments and decrements in associative 
strength. We noted that in virtually 
all contemporary learning theories, the 
concept of strength is assumed to have 
at least three different empirical man- 
ifestations—the habit hierarchy, the 
growth of associative strength in the 
sense of response probability as a func- 
tion of number of reinforcements, and 
the increase in resistance to forgetting 
of once established associations with 
additional reinforcement. We tested 
for all three of these effects by means 
of the simplest controlled comparisons 
we could arrange, doing our utmost to 
eliminate the confoundings and the lay- 
ers of statistical processing that shield 
the behavioral changes occurring on in- 
dividual learning trials completely from 
view in conventional experimental de- 
And under these presumably 
favorable circumstances, all three ef- 
fects mysteriously evaporated, leaving a 
picture of unitary associations the learn- 
ing and unlearning of which proceed on 
an essentially all-or-none basis.* 


signs. 


8 Several readers of a prepublication draft 
of this article have raised the question 
whether alternative measures, e.g., response 
latencies or recognition scores, might yield 
evidence of learning in cases where the prob- 
abilities of conditioned responses or of cor- 
rect recalls do not. Regardless of the an- 
swer, it is important to note that information 
about concomitant changes in other variables 
would have no logical bearing upon conclu- 
sions pertaining to the one actually chosen 
for analysis in a given experiment. To 
determine whether the behavior of recalling 
the correct response to a paired-associate 
item is learned in all-or-none fashion, we 
required an analysis according to the par- 
adigm of Figure 2 with recall score as the 
dependent variable. To determine whether 
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What is the import of these results 
for the question raised earlier concern- 
ing the strategic soundness of the gen- 
eral methods on which investigators of 
learning chiefly rely? The concept we 
have examined in detail epitomizes the 
intervening-variable paradigm for the- 
ry construction that has been popular- 
ized by Tolman and Hull, and their 
followers, to the point of dominating 
contemporary learning theory. The 
general technique is to postulate a hypo- 
thetical state or entity which is held to 
intervene in some 
served stimulus and response variables. 


Predictions derived from theories built 


around such conStructs are checked 
against data from standard learning ex- 
periments. Thus the adequacy of the 
theories depends in turn on the ade- 
quacy with which essential aspects of 
learning are 


al 


captured by standard ex- 


perimental designs 


I have emphasized the term “stand- 


rd’ for despite the fact, decried by 

critics, that psychologists in this 
all admonitions to 
dize their tasks, situations, and 
they have more than com- 
pensated for this lack by the degree to 
which they have standardized their ex- 
perimental designs. Scanning the ref- 
erence list of, say, Hilgard’s Theories 
f (1956 


lave resisted 
standar 


procedures, 


of Learning ’ one can find no 


t recognition of 


a previously viewed 
learned on an all-or-none basis, we 
wuld need a similar analysis with recogni- 
re as the dependent variable. To 
letermine whether the behavioral change as- 
iated with a decrease in latency is learned 
an all would need a 
nilar analysis with some criterion of change 


latenc as the dependent variable, and 


item 15 


or-none hasis. we 


irvest is even scantier if one con- 
more monographic works, e.g., Tol- 

nan (1932), Hull (1943), or Spence (1955), 
sociated with systems of intervening-var- 
es in behavior theory; or, for that matter 
theoretical writings (Estes, 1959) 


lectic treatise of McGeoch and Irion 
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sense between ob-' 
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more than a meager handful of studies 
which are not molded into a paradigm 
that might be termed the groups-by- 
trials design. The master blueprint re- 
quires the investigator to average some 
type of performance score, e.g., fre- 
quency of conditioned responses or of 
correct responses, over a series of trials 
for groups of Ss—the groups being dif- 
ferentiated on the basis of conditions 
obtaining over the series. Going back 
a few years in the experimental jour- 
nals, one finds the customary output of 
this experimental paradigm to be a set 
of mean performance curves for experi- 
mental and control groups; more re- 
cently the output is typically an array 
of analysis of variance tables, perhaps 
supplemented by some mean criterion 
scores. These observations in them- 
selves do not constitute a criticism. No 
one would gainsay that the groups-by- 
trials design and its associated statis- 
tical techniques are useful tools for 
assaying the effects of various proce- 
dures and conditions upon performance. 
It is easy, however, to overlook the fact 
that the groups-by-trials design yields 
only information about relationships 
and trends which hold on the average 
over groups of Ss and series of trials. 
No accumulation of experiments, how- 
ever large, all conducted and all an- 
alyzed in accord with this same general 
method can provide a sufficient em- 
pirical check on concepts and assump- 
tions that refer to processes or events 
occurring in the individual learner. The 
findings we have considered in this 
paper suggest that in point of fact some 
of the most firmly entrenched concepts 
and principles of learning theory may 
be in a sense artifacts of a conventional- 
ized methodology. 

The laboratory investigator of learn- 


(1952), deviant experimental designs appear 
in references with “ie earliest dates but dis- 
appear as one comes down to the contempo- 
rary literature 
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ing is used to going his own way more 
or less oblivious to the rattle of criti- 
cisms from textbook writers, educators, 
and other “outsiders.” Traditionally 
the criticisms have always been to the 
same effect—that the “learning experi- 
mentalist” should push on from the 
worn out fields of conditioning and 
simple verbal learning, where there are 
really no unsolved problems of any im- 
portance remaining, and devote himself 
to richer, more complex, experiments 
that come closer to learning situations 
of real life (as contrasted with the un- 
real life found in the laboratory). Here 
we break with tradition, for the crit- 
icism generated by our work within the 
field suggests that progress toward a 
satisfactory theory of learning requires, 
not more complex, but simpler experi- 
ments. The conventional experiment, 
far from being oversimplified, repre- 
sents such a complex and intricate con- 
founding of stimulus and response var- 
iables over trials that once it has been 
done and reported in conventional form, 
no amount of study of the analysis of 
variance tables and Vincentized per- 
formance curves can discloge,the effects 
exerted by specific causal variables on 
individual Ss upon particular oc<asions. 
Concepts depending solely upon the 
conventional experiment for support 
may turn out to belong, not to psy- 
chology of learning, but only to-a psy- 
chology of the criterion score, the mean 
performance curve, and the groups-by- 
trials design. 

In concluding, I would like to indi- 
cate that I do not mean to offer the 
experimental findings reported in this 
paper as a crucial test of incremental 
vs. all-or-none theory. What does seem 
clear with respect to the former is that 
the kind of evidence heretofore adduced 
in support of incremental theories is 
inadequate to distinguish them from al- 
ternative conceptions. More penetrat- 
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ing experimental analyses are required. 
We have made a start in this direction, 
and from the early returns it appears 
that no extant theory of the incremental 
type can handle the pattern of results 
that is emerging. The temptation is 
great to indulge now in a bit of specula- 
tion as to whether the answer to this 
situation will prove to be a remodeling 
of one of the familiar incremental the- 
ories, or perhaps a quite different the- 
ory based on all-or-none assumptions— 
but this is a step I do not intend to take. 

Since the earliest days of association- 
ism, overdependence upon speculation 
and circuitous inference has impeded 
the interplay of theory with experiment. 
While I would not for a moment de- 
preciate the role of imagination in sci- 
ence, I suspect that it will begin to 
serve us effectively in learning theory 
only as we begin to accumulate reliable 
determinations of the effects of single 
variables upon single learning trials in 
individual organisms. If by continual 
simplification of our experimental anal- 
yses and refinement of our mensura- 
tional procedures we can achieve these 
determinations, we may find that the 
long sought laws of association may 
be not merely “instigated,” or even 
“suggested,” but literally dictated in 
form by empirical data. 
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The learning of avoidance, as Solo- 
mon and Brush (1956) have re- 
marked, is something that we all take 
for granted. But, and they have put 
it particularly well, the 
failure of Ss to learn to avoid are not rare, 
but they are less apt to be reported in scien- 
tific papers than are the successes. And the 
conditions surrounding failures to learn to 
avoid are as instructive to the investigator, 
in many cases, as are the optimum conditions 
for producing the avoidance phenomenon. 


It is in this spirit that we offer 
certain data failures to 
avoid. These collected in 


concerning 


have been 


both formal and informal studies that 
have now occupied the better part of 


All have been con- 
within the 
y one or another 
leading brand of commercial lever- 
pressing apparatus. 


some two years. 
ducted rats, and all 
format provided by 


with 


Some slight dif- 
ferences have been observed between 
the different brands, but we can con- 
clude at the outset that the problems 
herein described are not unique to any 
Rather, the difficulties seem 
to be inherent in the lever-pressing 
situation, and not in different 
stellations as provided by the different 
fabricators. 


variant. 


con 


Tue Basic PROBLEM 
We must emphasize that such equip- 
rats to 


lever 


ment can be used in training 
1 


make 


press avoidance responses 


stimulus-discriminate 
Our con 
‘This research was supported in part by 
a grant from the National Institutes of 
Mental Health. 
2From the Laboratory 
and Physiological 


of Comparative 
Psychology 


cern is wholly with the rate at which 
such learning is established. We had 
first obtained this apparatus for some 
studies of the functions of the visual 
cortex; these investigations had been 
p'anned as extensions of the work of 
Meyer, and Maher (1958). 
The bar-press situation was selected 
wanted to examine two 
responses that would be the same ex- 
cept for the incentive under which 
their learning was established. Fur- 
ther limitations thus imposed were 
two: that light be the warning stim- 
and that the habit be learnable 
within a single day. These did not, 
at first, appear to be unreasonable re- 
quirements. The primitive device that 
we had used before had worked well 
enough to give us learning in very 
rarely more than 100 light-shock pres- 
entations. It was our presumption 
that the lever-pressing habit could not 
be established quite so quickly, but we 
readily accepted this 
convenience that 
afforded. We were not, however, pre- 
pared in any for our 
pirical encounter, during which a rat 
failed to show reliable avoidance after 
several thousand trials. 

A few such experiences led us to 
conclude that we 
thing very wrong; it did not seem pos- 
sible that lever-press 
could be as bad as this for the purpose 
However, the first of our inquiries to 
with the 
situation did not prove to be encourag 


Isaac, 


because we 


ulus, 


because of the 
the new equipment 


way first em 


were doing some- 


conditioning 


persons familiar bar-press 


ing: we learned, for example, that it 
often takes as much as 40 hours to 
establish stable levels of performance 
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Somewhat salubrious estimates 
were nearer to 10, but it seemed that 
not all rats could be counted on even 
then. This perhaps should not have 
shocked us, but it did—for we were 
remembering with sorrow that Mote 
(1940) had trained his average rat to 
turn a wheel within 50 light-shock 
presentations. The figure of merit for 
the lever-press response is, by this 
comparison, so poor that only in an 
automated era could it possibly survive 
as an indicant of choice. 

In retrospect, so far as our brain 
work was concerned, our course of 
action at this point should have been 
to build a Brogden-Culler apparatus 
and give up lever pressing forthwith. 
But it is difficult to turn one’s 
back to a gleaming, thousand-dollar 
box: then too, we were still convinced 
that some combination of factors would 
produce efficient learning. Therefore 
we proceeded to assess the most ob- 
vious of these in an informal manner; 
bar weights, shock strengths, and light 
first con- 


more 


most 


intensities 


] 


siderations. 


were 


among the 
Throughout these experi 
ments, the general conditions were all 
very much the same: the warning stim- 
ulus consisted of a sharp illumination 


of the box; the box was darkened 
when the bar was pressed, and this re- 
sponse terminated shock on trials when 
the shock was not avoided. 

There was never any problem what- 
soever in training rats to make escape 
indeed, after relatively few 
shaping trials than 30 
or 40—the trials 
were so short that it could be doubted 
that the shocks received were noxious. 
Frequently the rat learn to 
10ver just above the locus of the bar; 
this position was maintained through- 
out the presentation of the warning 
light. When the shock arrived, the 
rat depressed the bar very quickly, but 
in most instances it failed to learn to 


Te spt mses: 
rarely 
latencies on 


more 
most 


would 


do this during the warning interval. 
To counter this maneuver of “avoid- 
ance by escape” we next arranged a 
program such that shocks could al- 
ways be avoided, but that periodically 
were of a fixed duration rather than 
escapable. This of all the variables 
informally considered seemed to make 
at least some difference. 


Two ILLUSTRATIVE EXPERIMENTS 


Two formal studies serve to illus- 
trate just how difficult this problem is. 
We had hoped, because of certain 
problems of control, to train at least 
three rats a day; practically, the most 
that we could possibly invest in train- 
ing was a 4-hour session. We were 
therefore driven to consider massing 
as a possibility and hence trained 
three groups of nine rats each with 
intervals between trials of 20, 40, and 
60 seconds, respectively. Each trial 
consisted of light presentation for a 5- 
second period; pressing of the bar 
turned off the light and stopped or 
prevented the occurrence of a shock of 
1.6 milliamperes. The outcome, now 
so typical of many, was that the animals 
learned virtually nothing during their 
first training sessions, and this despite 
the fact that they had first been shaped 
to make escape responses and then had 
been presented with from 240 to 720 
light-shock pairings. 

Some small evidence of avoidance 
learning did appear when training was 
continued for a second daily session. 
The intermittent nonescape procedure 
then was instituted, and the rats were 
trained for three more days. But the 
final outcome, based upon a total of 
from 1,200 to 3,600 presentations, was 
that the 20- and 40-second groups still 
were not avoiding properly. For these 
groups, in fact, the median percent 
avoidance were .59 and 1.32. 
The 60-second group was somewhat 


scores 
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better ; here at least some animals per- 
formed beyond a level of 50% avoid- 
ances. Only four, however, did so, 
and the median performance for the 
group was mere 6.69%. 

At this juncture, we began to specu- 
late that the problem might be in 
duration of the warning stimulus. This 
has been manipulated widely in dif- 
ferent experimental contexts; Warner 
(1932), for example, examined a range 
of from 1 to 30 seconds in a barrier 
situation. He found that learning was 
progressively faster as the warning 
interval decreased, but did not examine 
the range below 10 seconds in detail. 
In most relatively recent studies, these 
have been the intervals of interest, and 
5 seconds seemingly is most popular. 
Mote, on the other hand, presented 
light for just 2 seconds, and this in- 
terval was used successfully by Meyer, 
Isaac, and Maher. While we could 
not guess what the optimum would be 
in lever-pressing situations, it appeared 
to us that the best place to look was in 
the range below 5 seconds. 

For this experiment, the presenta- 
tions were given at a rate of one a 
minute; four groups of rats, 15 in a 
group, were trained for a total of six 
There were two daily training 

each of three 
hours long; there were thus 180 pair- 
ings per day and 360 total pairings. 
We must say, perhaps, that these are 
few conditioning 
standards ; number is 
seven times as large as that required 
for learning in a wheel. 

The differential treatments were 
confined to the first training session. 
Warning stimulus durations prior to 
shock were, for the four respective 
and 2.5 
During the second session, this was 
fixed for all at a value of 5 seconds. 
This was done so that the superiority 
the differential treatments 


hours. 


sessions, which was 


trials by 
however, the 


operant 


groups, 0, .5, 1.5 seconds. 


of one of 
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might be evidenced without the prej- 
udice of changing latency requirements. 
Here again, as in all the studies we 
have done, lever-pressing terminated 
light ; if the animal did not avoid, light 
and shock went off together. 

These procedures yielded 
performances approximating 20% ; 
variations for the different intervals 
proved to be no more than 5%. But 
even these performances could not be 
taken as evidence for some avoidance 
learning; associated with them were 
bar-pressing rates that often reached 
extremely high values. There were 
many instances, in fact, of rates above 
two thousand per hour; had these 
rates been evenly distributed in time, 
at least a third of all the subjects 
would have shown us ultimately per- 
fect avoidance. We obtained, instead, 
a suppression of the rate when the 
warning stimulus came on; and this 
suppression was evidenced throughout 
the conduct of the study. 


median 


INTERPRETATION 


Suppression, in itself, is not sur- 
prising. It is, indeed, the essence of 
the readily established CER of Hunt 
and Brady (1951); we expect it and 
we almost always find it regardless of 
our test technique. The question, 
rather, is why such suppression per- 
sists in lever-pressing situations, when 
its existence presumably implies a 
drive lever pressing reduces. 
If light produces “fear,” and shock 
produces pressing, and pressing elimi- 
nates the light, how by any concept of 
avoidance learning can we fail to get 


which 


avoidance ? 

We have found but one asylum, and 
that is in supposing that the offset of 
light is not an effective “fear-reducer.” 
Having done so we can only wonder 
why we had expected it to be; in our- 
selves, we know that “fears” persist, 
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once evoked, far beyond their instiga 
tors. We do not intend this statement 
as a glove to Mowrer and Lamoreaux 
(1951) however; all that they have 
said is that termination ‘will have the 
effect of reducing the rat’s fear at least 
a little and will thus reinforce, at least 
mildly * the response which brings 
about the termination. Some rein- 
forcement is undoubtedly produced 
and we would merely be inclined to 
substitute the “trivially” for 
“mildly.” 

But why is the particular form of 
Most ex- 
perimenters with whom we have dis- 


word 


the response so important ? 


cussed this problem have considered it 
to be a matter of the relative availa- 
bility of the response to be conditioned. 
However, it is difficult to argue that 
responses sometimes made as many as 
20,000 times are lacking in availability. 
We believe, instead, that response to- 
pography is of consequence for other 
reasons and that we can best begin ta 
by first reviewing 
the 


show that this is so 
Schoenfeld’s (1950) 
process of avoidance. 


While recognizing that a light, such 


concept of 


as we have used, becomes aversive 


after noxious pairings, Schoenfeld has 
emphasized the role of proprioception 
as linking avoidance to escape. In a 
sense, his view is that the rat, in avoid 
ing, is escaping from its own response 


Termination 


to the warning stimulus. 


of the proprioceptive feedback is a 
reinforcing operation, as is termination 
of the now-aversive warning stimulus 
itself We believe that this approach 
is basically correct from the standpoint 
of its emphasis and that we can extend 
it to interpret the fact that apparatus 
can be ordered in efficiency for avoid- 
ance learning from wheel to shuttlebox 
to lever. 

Briefly, the this. If 
the termination of a proprioceptive 
reinforcing function, 


argument is 


pattern has a 


promptness and completeness will de- 
termine effective this will be. 
Promptness and completeness will 
themselves depend upon our getting 
rid of the responses whose presence 
at the moment has yielded the propri- 
pattern to be terminated. 
will, of dis- 
appear upon removal of related stimuli, 
but others will persist unless inhibited 
through evocation of antagonists. The 
freezing-crouching pattern, or condi- 
tioned “fear,” is a member of this lat- 
ter group; thus the termination of its 
proprioceptive pattern will require re- 
sponse competition. 

Since the freezing-crouching pattern 
is a posture, suitable antagonists are 
few; in specifying these, we think we 
specify the quickly learned avoidance 


how 


oceptive 


Some responses course, 


responses. These responses must, in 
scope, be commensurate with the pat- 
tern of conditioned fear, for when this 
is only partially the case the inhibition 
The lever-press 
response thus is learned but slowly be- 


cannot be complete. 


cause it is a poor antagonist; its oc- 
currence can be quite compatible with 
persistence of the prior pattern. But 
obviously different 
with a jump or running response; 
these yield us competition, termination 
and rapid avoidance learning. 
Mowrer (1940), twenty years ago, 
remarked in passing that he had had 
no luck when he had tried to establish 
evidence of good avoidance learning 


the situation is 


with a “type of pedal-pressing re- 
With his conclusion we must 
and we think our tests 
exhaustive. However in- 
the general implication, 
operants are not arbitrary; in avoid- 
ance learning, their selection is per- 
haps the most important of considera- 
Possibly, a standard box ‘can 
be so modified that suitable responses 
will appear; we ourselves are trying 


sponse.” 
heartily agree, 
have been 
convenient 


tions. 


this in a minor way with a “hit-the- 
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pattern. Whether this will 
work as well as a shuttlebox is some 
thing that seen, but 
never again will we regard such situa- 
tions as outmoded and inelegant sur- 


ceiling” 


remains to be 


vivors of an era best forgotten. 
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There has been a growing research 
interest in originality during the last 
decade. Areas of major concern have 
been the study of the productivity of 
creative individuals (Dennis, 1958; 
Lehman, 1953; Taylor, 1956, 1958; 
Van Zelst & Kerr, 1951), their per- 
sonality (Barron, 1955; Drevdahl, 
1956; Taylor, 1956, 1958), the socio- 
logical and cultural factors promoting 
creativity (Barnett, 1953; Stein, 1953; 
Taylor, 1956, 1958), and investigations 
designed to develop reliable paper and 
pencil tests for the study of individual 
differences (Guilford, 1950; Spring- 
bett, Dark, & Clarke, 1957; Wilson, 
Guilford, & Christensen, 1953).2 — 

Although there is general recogni- 
tion of the importance of the problem 
of devising training techniques for fa- 
cilitating creativity, most of the work 
in this important area has been anec- 
dotal or merely hortatory (Ghiselin, 
1955: Mearns, 1958; Osborn, 1957; 
Slosson & Downey, 1922). 


Scope oF THIS PAPER 


The purpose of this paper is to re- 
view the experimental research that 
may be relevant to the problem of de- 
vising techniques for increasing origi- 
nality and to indicate the behavioral 
principles possibly involved in the pro- 
duction of the desired effect. 


1 This work was supported by contract 
Nonr 233(50) between the Office of Naval 
Research and the University of California. 
Reproduction in whole or in part is per 
mitted for any purpose of the United States 
Government 

2 These are only selected references and 
are not intended as an exhaustive survey of 
the relevant literature. A more complete 
list of references on originality may be found 
in Taylor (1956, 1958). 


For purposes of further discussion 
we shall distinguish between original- 
ity and creativity. Originality, or orig- 
inal thinking, as we shall use the term, 
refers to behavior which occurs rela- 
tively infrequently, is uncommon un- 
der given conditions, and is relevant 
to those conditions. Criteria of rele- 
vance and uncommonness, of course, 
must be established for any given situ- 
ation. This is a problem more readily 
solvable within the laboratory than 
without. Creativity, according to the 
present usage, refers to products of 
such behavior and the reactions of 
other members of a society to those 
products. An invention is a creative 
product that may have an important 
effect upon society and is a conse- 
quence of original behavior. But con- 
siderably more variables enter into the 
determination of creative works than 
originality alone. Our distinction im- 
plies that an individual may be highly 
original but not creative. His brilliant 
theories are never published, or they 
are not implemented by the necessary 
research. Perhaps the original re- 
search is conducted, but the results are 
not published or patented. All this 
may be done, but the work may be 
overlooked or the implications disre- 
garded by the appropriate community 
of scholars or society at large. Fur- 
ther common illustrations may be 
given, indicating that a great many 
more behavioral and societal variables 
influence creativity than originality, 
making the study of originality under 
simplified laboratory conditions more 
feasible than that of creativity. Many 
of these variables may affect original 
behavior as well, but they are not of 


799 





230 


such overwhelming importance when 
the training of originality is consid- 
ered. Our subsequent discussion 
therefore will be concerned solely with 
originality as here defined. One fur- 
ther distinction in the experimental 
study of original behavior needs to be 
made. Such behavior is always rela- 
tive, either to a given individual’s past 
behavior or to the norms of a popula- 
tion of which he is a member (Barron, 
1955). The two are no doubt corre- 
lated. But it is the latter which is 
related to creative works; and since all 
of the pertinent experimental research 
has been of this kind, we shall be con- 
cerned only with the study of origi- 
nality relative to some group norm. 
Guilford (1959) also distinguishes 
between originality and creativity. 
However, the basis for the distinction 
from his factor analytic ap- 
Originality, defined essen- 
tially in the same fashion as in the 
several be- 


stems 

proach. 
present paper, is one of 
havioral traits contributing to creativ- 
ity. The latter is a more general be- 
havioral trait including as components 
several forms of flexibility, fluency, 
and motivational and temperamental 
traits in addition to originality. Al- 
though this is a reasonable empirical 
classification of traits based upon fac- 
tor analysis, it does not necessarily 
contradict the We do 
not, however, accept Guilford’s addi- 
tional assertion that an S-R approach 
cannot deal adequately with creativity. 


present one. 


ACCOUNTS OF ORIGINALITY 


TRAINING 


EARLIER 


\lthough psychologists have done 
relatively little experimental research 
training of 
psychologists have 
with the problem for 
What is more, their suggestions, al- 
though not supported by 
meeting the usual scientific standards, 


on the originality, non 


been concerned 
some time 


evidence 
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are frequently in accord with com- 
monly accepted behavioral principles. 

For example, Mearns (1958) in a 
book originally published in 1929 re- 
peatedly emphasizes that in order to 
facilitate the originality of school chil- 
dren in the arts the teacher must re- 
inforce, manifestly approve, his orig- 
inal efforts. He illustrates many 
techniques for inducing the child to 
make public his efforts, to gain the 
confidence of children so that com- 
munication will occur with their “sec- 
ret unexpressed selves.”” Furthermore, 
the teacher is advised to approve of 
only the genuinely original effort. The 
teacher is further cautioned to 
patiently for the appearance of original 
behavior which is fostered by a “per- 
missive atmosphere,” the absence of 
“drill” and excessive discipline. Ac- 


wait 


cording to Mearns, original behavior 
appears eventually because all normal 
children have 
pulse to be creative. 

Aside from the romantic approach 
to a problem of behavior, we would 
agree that the way to foster originality 
is to reinforce such behavior when it 
occurs. <A basic difficulty is that it 
may not occur at all or at such in- 
frequent intervals that the reinforce- 
ments cannot sii "e up such behavior. 
Thus the fund. ..ntal problem in the 
training ; devise 
methods for occurrence 
in diverse sit 1s get it to 
occur in the fir cby per- 
mitting the oper ‘ement. 
If the operant | 
conditionin: vill not be ef- 
fective, training cannot occur. W< 
assuming, then, that originality can be 


an urge, energy, or im- 


of iginarty is to 


iginality is 
too le yw, 


are 


learned and that the same principles 
of conditioning hold as in other forms 
of operant behavior. 
it can be reinforced, approved, recog- 
It will then show an increased 
An appar- 


Once it occurs 


nized. 
probability of occurrence. 





ON THI 
ent difficulty, however, is that origi- 
nality is manifestly different from 
other behaviors. How can the rein- 
forcement of one bit of uncommon 
behavior increase the frequency of 
other uncommon behaviors which by 
definition are different. Despite the 
apparent paradox, this sort of non- 
specific transfer does occur, as we 
shall see. 

Royce (1898), as far as we know, 
was the first to experimentally attack 
the problem of devising techniques for 
facilitating the occurrence of original- 
ity. He employed two different pro- 
cedures. First, Ss were asked to draw 
a series of nonsense figures unlike any- 
thing they had ever seen. A second 
procedure was to present a series of 
drawings to S with instructions to 
draw figures as different as possible 
from each of the models. Examination 
of the qualitative changes that oc- 
curred under conditions led 
Royce to conclude that these proce- 
dures 


these 


may facilitate originality and 


that they correspond to conditions in 


SOK iety that are conducive to original- 
ity. 


Slosson and Downey (1922) suggest 
another method for training and test- 
ing originality in writing fiction. They 
suggest writing a plot, characters, or 
an entire story, based upon the more 
unusual found in the per- 
sonal columns of newspapers. Head- 


training 


messages 


lines are also proposed as 


materials. Variations are possible in 
the use of the materials by placing 
restrictions on the kind of plot or 
characters to be invented. Thus, given 
an unusual message, the writer is 
forced to practice literary invention. 
The assumption is that this form of 
“mental gymnastics” will increase the 
originality of his writing. 

A simple test of originality sug- 
gested by Slosson and Downey (1922) 
is to how different 


determine many 
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plots can be written to the same mes- 
sage, and for how many messages can 
plots be written in a given time in- 
terval. A more formalized test is also 
presented, but no empirical data are 
given from its use. Likewise, no data 
are presented on the effectiveness of 
the training method which appears to 
have been employed as a teaching tech- 
nique in creative writing courses at 
the college level. 

Osborn (1957) also believes that 
originality can be trained and that 
practice in producing original “ideas” 
will further develop originality. He 
gives many exercises designed to pro- 
vide such training. They typically 
consist of various kinds of problem 
situations for which as many different 
solutions as possible are to be offered. 
The quantity of different “ideas” is 
stressed, since it is believed that the 
likelihood of original “ideas” increases 
with increases in quantity. 

Osborn (1957) initiated 
a procedure of “group ideation” known 
as “brainstorming” which is designed 
to evoke original and which 
has received considerable attention 
from the public press. It is a modified 
group free association procedure with 
emphasis upon producing large num- 
bers of “ideas” in the absence of criti- 
cism or judgments as to their value. 
Critical evaluation of “brainstorming” 
sessions occurs later in order to avoid 
during the 


has also 


“ideas,” 


inhibiting unusual “ideas” 
session. 

Although there have been courses 
established to teach “brainstorming” 
in business and industrial firms as well 
as colleges, there is as yet little ob- 
jective research demonstrating its 
efficacy. 

Taylor, Berry, and Block (1957) 
found that group. brainstorming did 
not produce a greater number of 
“ideas” or unique “ideas” as compared 


to individuals working alone and 
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whose performance was scored as a 
nominal group. In fact, it was found 
that the nominal groups were 
nificantly superior to the brainstorm- 
ing groups under the conditions of 
their experiment. Corroborative evi- 
dence was found in another study by 
Taylor and Block (1957). These ex- 
periments, however, were not designed 
to study the effects of originality train- 
ing in groups or by individuals. They 
provide no information on the problem 
of training originality, whether giving 
many different solutions to problems 
results in an increased probability of 
giving original solutions to subse- 
quent different problems. 

Meadows and Parnes (1959) have 
recently reported that a course based 
upon the principles described in Os- 
born’s book (1957) produced a sig- 
nificant increment in originality in its 
students as compared to a control class 
taking a different course. But as in 
many classroom experiments, there is 
the difficulty of assessing what the 
relevant variables are, whether it was 
the training in problem solving per 
se, or the changes in motivation result- 
ing from participating in a class and 
studying a procedure which explicitly 
claims to facilitate problem solving. 
There are additional difficulties inher- 
ent in the study since no attempt was 
made to control or assess the extent of 
differential rehearsal of the test mate- 
rials in experimental and _ control 
The experimental class prac- 
ticed on materials very similar to the 
originality tests given before and after 
the courses were taken. Finally, the 
design employed permits a_ biased 
sampling of Ss, since the “creative 
problem solving” course was an elec- 
tive one. 

The early psychological literature as 
well as the suggestions of the non- 
psychologists reviewed tend to agree 
upon a small number of different pro- 


sig- 


classes. 
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cedures for increasing originality. 
One training procedure is to present 
an uncommon stimulus situation, a sit- 
uation for which common or conven- 
tional responses may not be readily 
available. Relatively uncommon re- 
sponses may be evoked as a conse- 
quence (Slosson & Downey, 1922; Os- 
born, 1957). A second procedure is 
the evocation of different responses to 
the same stimulus situation (Royce, 
1898; Osborn, 1957). Under such 
conditions the successive responses 
may become more uncommon. A third 
training procedure is the evocation of 
the uncommon responses as textual 
responses. Several recent experiments 
have employed this procedure (Jud- 
son, Cofer, & Gelfand, 1956; Maltz- 
man, Brooks, Bogartz, & Summers, 
1958; Maltzman, Simon, Raskin, & 
Licht, 1960) and will be discussed 
shortly. Instructions, as distinguished 
from training, may also be used to in- 
crease originality, and appear to be 
effective under certain conditions 
(Christensen, Guilford, & Wilson, 
1957; Maltzman, Bogartz, & Breger, 
1958; Royce, 1898). 


“PRODUCTIVE” THINKING 
ORIGINALITY 


AND 


Before turning to the current ex- 
perimental research on _ originality 
training, however, note should be taken 
of several recent studies that have em- 
ployed Maier’s (1931) two-string or 
analogous problems as a test situation. 
The reason for commenting upon these 
studies is the assumption that there 
is no fundamental difference in the be- 
havioral principles determining origi- 
nality and problem solving behavior 
generally. Both involve the evocation 
of relatively uncommon responses, 
otherwise the situation would not be 
called a problem or the behavior orig- 
inal. This is especially apparent in 
connection with Maier’s two-string 
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Maier (1931) considers the 
pendulum solution to be a consequence 
of productive thinking whereas other 
solutions represent reproductive think- 
ing. As indicated elsewhere (Maltz- 
man, 1955) these problem solving be- 
haviors do not involve fundamentally 
different laws as demanded by Maier, 
but only a difference in the extent to 
which initial response hierarchies are 
modified. Furthermore, as far as can 
be determined this also is the basis 
that Maier used for distinguishing re- 
productive from productive solutions. 
The former solutions were given by 
his Ss upon the initial presentation of 
the problem. Only after these solu- 
tions were given and the Ss were told 
to find still another solution, which 
happened to be the pendulum solution, 
did Maier call it productive thinking. 
The empirical basis for the distinction 


problem. 


between reproductive and productive 
thinking, then, was the probability of 
solution 


occurrence of the under his 


experimental conditions. The pendu 
lum solution was relatively uncommon 
and is original in this sense. 

Several studies have attempted to 
determine, with some success, whether 
verbal responses necessary for the solu- 
tion of a subsequent problem are pres- 
ent in the initial response hierarchy 
of Ss (Saugstad, 1955; Saugstad & 
Raaheim, 1957; Staats, 1957). The 
determination is made by obtaining 
word associations to objects later 
found in the problem situation. Re- 
sponses necessary for the solution of 
the problem are presumably uncom- 
mon, have a low probability of occur- 
rence in the initial response hierarchy. 
An association procedure has also been 
used to determine the uncommonness 
of responses evoked in the initial re- 
sponse hierarchy and their relation to 
subsequent success in the two-string 
problem (Maltzman, Brooks, Bogartz, 
& Summers, 1958). 
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A series of experiments by Judson, 
Cofer, and Gelfand (1956) is a limit- 
ing case of the use of textual responses 
in facilitating “productive” problem 
solving or originality. They employed 
the two-string problem as the test and 
rote learning as the training situation. 
Instead of attempting to increase the 
occurrence of uncommon responses 
during training and determining their 
transfer effects in the test situation, 
they induced directly the response 
chain necessary for a pendulum solu- 
tion to the problem. A group form 
of the two-string problem was em- 
ployed, and the Ss were required to 
write as many solutions as possible to 
the problem of grasping both strings 
simultaneously. The design of the ex- 
periment was to prompt a pendulum 
solution to the problem by presenting 
an experimental group with different 
lists of words, one of which contained 
the response chain “rope,” 
. “pendulum.” Additional con- 
trol groups were employed receiving 
word lists including none of these 
words, one or all three but each in a 
different list. In this study as well as 
in two replications, the experimental 
group supcrior to the control 
groups, although the differences were 
not always statistically significant. A 
further qualification is that these re- 
sults hold only for men. No reliable 
differences obtained among 
women in the various conditions. An 
analogous experiment was conducted 
employing a group form of Maier’s 
hat rack problem, and comparable re- 
obtained. These studies 
suggest that repeated evocation of the 
critical words during rote learning in- 
creased the probability of their occur- 
rence in the problem situation which 
in turn lead to an increased frequency 
of the desired problem solution. 

Another related study has been re- 
ported by Flavell, Cooper, and Lois- 


relevant 
‘swing,”’ 


was 


were 


sults were 
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elle (1958). Their study was designed 
to demonstrate that “functional fixed- 
ness” could be reduced by evoking un- 
common responses to an object during 
a pretest period. In one respect it is 
more pertinent to the preblem of pro- 
ducing relatively nonspecific transfer 
than the studies by Judson, Cofer, and 
Gelfand (1956), because the responses 
evoked during the pretest period were 
not the appropriate ones in the subse- 
quent test problem. However, for our 
purposes the experiment suffers the 
limitation that it does not show an ac- 
tual facilitation of problem 
performance but only a reduction in 
“functional fixedness.” 

A series of studies by Maltzman, 
Brooks, 3ogartz, and Summers 
(1958) combines features of both of 
these experiments (Judson et al., 
1956; Flavell et al., 1958) by evoking 
uncommon textual responses during a 
pretest period which are not involved 
in the problem solution in any obvious 
manner. Evidence of facilitation was 
obtained under these conditions. A 
screwdriver and a block of balsa wood 
were the objects available for use in 
the problem Only the 
screwdriver was sufficiently heavy to 
a string 

In the 


solving 


situation. 
serve as a weight permitting 
to swing in pendulum fashion. 
first experiment a control group was 
given the test problem without any 
pretraining. Lists of unusual uses for 
a screwdriver, balsa wood, and string 
were read by the experimental group 
before they performed in the problen 
situation. None of the uses referred 
to operations involved in the experi 
ment. It wa. ievertheless found that 
the experimental solved the 
problem significantly faster than the 


group 


control group, and these results held 
for both sex Additional ex- 
with the 
lists of unusual uses also obtained re- 
liable differences between experimental 


Jsroups. 


periments modifications in 
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and control Ss, but typically in the 
case of women and not the men. 
Many variables in this experimental 
situation are in need of further analy- 
sis. However, the inherent lack of re- 
liability of the two-string problem as 
a test and the difficulties involved in 
obtaining experimental control over 
the relevant variables in the situation 
suggest that more fruitful studies of 
originality and problem solving gen- 
erally should employ another type of 
experimental situation. Also, with re- 
spect to the study of originality, test 
problems of this sort usually involve 
uncommonness of response only by in- 
ference or by definition. A more di- 
rect measure of originality would be 
highly desirable. 

CURRENT ST! ORIGINALITY 
/TRAINING 


DIES OF 


Because they seemed to meet the 
desired requirements, tests of original- 
ity of the kind developed by Guilford 
(1950) and his (Wilson, 
Guilford, & Christensen, 1953) as well 
as earlier writers (Hargreaves, 1927; 
Slosson & 1922) 


for a more detailed experi- 


associates 


Downey, were 


chosen 


mental analysis of variables involved 
i Maltzman 
1960). 


In the first experiment in this series 


in training originality by 
and his associates (1958, 


(Maltzman, Bogartz, & Breger, 1958) 
the Ss were run individually. All of 
the subsequent studies were conducted 
A modified free asso- 
employed in 
stimulus were se- 
basis of their restricted 


in group form. 
ciation procedure 
which the 
lected on the 


was 
words 
response hierarchies. Since a rela- 
tively small number of different re- 
sponses are evoked by the stimuli, a 
high degree of communality occurs 
among responses. A_ control 
group initial list of 25 
words to which they gave free asso- 
a different final list of 25 


their 
received an 


ciations and 





ON 


words to which they gave free asso- 
ciations. This was followed by the 
Unusual Uses Test of originality 
(Guilford, 1950). The latter test con- 
sists of a series of names of six com- 
mon objects for which S must give 
different uses other than their common 
everyday use. 

An experimental group received the 
same treatment except that the initial 
list was presented five additional times 
with instructions to give a different re- 
sponse on each repetition. A second 
experimental group was treated in the 
same fashion except that EF said 
“good” after every approximately 
fifth uncommon response during train- 
ing. Each of these three groups was 
subdivided prior to the introduction of 
the free association test list. Half the 
Ss in each condition were instructed 
to try to be as original as possible, and 
half were told nothing. Each of these 
subgroups were further 
prior to administration of the Unusual 
Uses Test and instructed to be as 
original as possible. These latter in- 
structions may have been redundant, 
if not confusing, since the written in- 
structions to the test already implied 
as much. 

Originality measures were obtained 
for performance on the free associa- 
tion list by determining the frequency 
with which the responses to each stim- 
ulus word occurred. The mean fre- 
quency of his associative responses was 
taken as each S’s originality score. 
Thus the lower the score the more 
uncommon are S’s responses on the 
average. The total number of unique 
responses given to the items was taken 
as the measure of originality on the 
Unusual Uses Test, where a unique re- 
sponse was defined as a use occurring 
only once in the sample. Results of 
the experiment showed that both train- 
ing and instructions produced a sig- 
nificant increase in originality on the 


subdivided 
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free association test list. What is 
more, the experimental subgroups re- 
ceiving training in addition to instruc- 
tions were significantly more original 
than the control group receiving in- 
structions alone, indicating that the 
training may produce its effects inde- 
pendently of instructions to be original. 
Results obtained from the Unusual 
Uses Test were ambiguous, yielding 
a significant triple order interaction 
where the experimental subgroup 
without originality instructions was 
significantly more original than its con- 
trol group. However, results on the 
Unusual Uses Test from subsequent 
experiments have consistently shown 
reliable training effects. 

Despite the above evidence indicat- 
ing that the training procedure facili- 
tated originality, additional informa- 
tion is needed to determine precisely 
what characteristics of the procedure 
are responsible for the effect. Differ- 
ences between the experimental and 
control groups may have been due to 
the difference in number of stimulus 
words presented, number of responses 
evoked, number of different responses 


evoked, etc. The fact that different 


responses were evoked by the same 


stimuli may be neither necessary nor 
sufficient for an_ effective training 
method. Additional research was 
therefore conducted in an attempt to 
isolate the relevant variables. 

In one study (Maltzman et al., 
1960) the control condition and the 
experimental condition without verbal 
reinforcement or instructions to be 
original were again employed. The 
latter group received instructions to 
give a different response to each pres- 
entation of the same stimulus word. 
For convenience they will be called the 
standard control and experimental 
conditions, respectively. Two addi- 
tional experimental groups received 
lists of 125 different stimulus words, 
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the same number as the repetitions of 
the initial list for the standard experi- 
mental group. One group received 
high frequency count stimulus words 
while the other group was presented 
with relatively low frequency count 
words (Thorndike & Lorge, 1944) 
This procedure equated the experi- 
mental groups for the total number of 
different responses. They differed in 
that the responses all occurred to dif- 
ferent stimulus words in the 
perimental conditions whereas differ- 
occurred to the 
stimulus words in the standard experi- 
mental condition. Another control 
group was added which received the 
same number of repetitions of the ini- 
tial list as did the standard experi- 
mental group, but these Ss were in- 
structed to try to give the same re- 
sponse at each repetition of 
stimulus. This would aid 


new ex 


ent responses same 


a given 
condition 


in determining whether the number of 


responses per se, or the number of lif- 
ferent responses, is the relevant vari 
able influencing test performance. 
Results from the free 
test of originality showed that the three 


association 


experimental groups did not differ sig- 
nificantly from each other and the two 
control groups did not differ signifi- 
cantly. But each of the experimental 
groups was significantly more original 
than each of the control groups. On 
the Unusual Uses Test the standard 
experimental group did_ significantly 
better than all the other groups. Thx 
two experimental groups receiving 
different stimulus words during train- 
ing did not differ from the standard 
control group, while the new control 
condition was significantly less original 
than each of the other conditions. 
The results from this experiment in- 
dicate that under some conditions the 
evocation of a relatively large number 
of different responses to different stim- 
uli may increase originality to the same 
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extent as the method of repeated evo- 
cation of different responses to the 
same stimuli. However, the two 
methods apparently differ in the ex- 
tent to which they produce transfer 
effects, as indicated by the significant 
difference between the two methods on 
the Unusual Uses Test. These results 
show that the evocation of different 
responses to the same stimuli is an im- 
portant aspect of the standard experi- 
mental procedure, although arousal of 
many different responses by different 
stimuli is also effective, within limits. 

The significant loss of originality by 
the new control group was not entirely 
unexpected, and shows that a training 
technique can significantly decrease as 
well as increase originality, a result 
which is of theoretical interest as well 
as suggestive for classroom practices. 
Exactly why the effect should appear 
on the Unusual Uses Test and not on 
the preceding free association test is 
not clear, and is a problem that re- 
quires further experimental investiga 
tion. 

Another experiment (Maltzman et 
al., 1960) was designed to further 
explore possible relevant variables and 


alternative procedures for facilitating 
originality. 

The standard control and experi- 
mental conditions again 
ployed. Another experimental con- 
dition was given repeated presentation 
of items from the Unusual Uses Test 
in order to determine whether the ef- 
training procedure 


were em- 


fectiveness of the 
is limited to only certain kinds of ma 
This 
presenting the list of six items a total 
of six times with instructions to give a 
different use prior to each repetition. 
A different kind of training procedure 
with a third experi- 
They were presented 


terials. was accomplished by 


was employed 
mental group. 
with a booklet consisting of 125 dif- 


ferent pairs of words. The Ss were 
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indicate the member of 
each pair which goes more readily 
with the stimulus word orally pre 
sented to them. The same stimulus 
words and number of repetitions as 
employed with the standard experi- 
mental conditions administered. 
Each pair of words were unique re- 


instructed to 


were 


sponses to the stimulus words selected 
from the previously established norms. 
A fourth group 
given the word pairs and asked to 
which member of each pair 
The 
stimulus words were not presented. 


> ls 


experimental was 
indicate 
they thought to be more familiar. 


» association and Un- 
that the 
condition was 
than the 
r groups on both tests, while 


showed 
ndard experimental 


more original 


groups did not differ sig- 
ficantly from each other 
In one of the new conditions uncom 


mon responses were evoked as textual 


responses stimulus 


following the 
words. In the other 


the textual occurred. 


condition only 
These 


order to 


response 
conditions were introduced in 
whether the uncommon re 
the stimu- 
lus words in order for the training to 
be effective. The fact that these 
groups were significantly less original 
than the standard experimental group 
suggests that, at least with the training 
uncom- 


determine 
sponses must be evoked by 
1 


and test “iuaterials employed, 
evoked as in- 


mon responses must be 


traverbal responses by the stimulus 


to be an 


wor Is if the re 1s 


increase in 


originality situations. 


9 in the test 
Since the experimental 
which received original training with 
items from the Test 
failed to significant training 


effect, another experiment was 


condition 


Unusual Uses 
show a 
con- 
ducted to determine whether the failur¢ 
was due to an inadequate amount of 
training. Four conditions were 
ployed in this study receiving different 


em- 
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amounts of training. None of the 
groups showed a significant increase 
in originality on the free association 
test when compared with the standard 
control condition. 

Results from the latter experiment 
and one condition from the previous 
experiment showed that training with 
unusual uses items in the same fashion 
as word association materials does not 
produce a comparable increase in orig- 
inality. Thus the method of repeated 
evocation of different responses to the 
same stimuli does not work with all 
materials, 

A basic assumption of our approach 
is that, if original behavior can be in- 
duced to repeatedly through 
some training procedure, it will in- 
crease in its frequency of occurrence 
in new situations. If this condition 
is not met, transfer of originality to 
new situations cannot occur. Inspec- 
tion of the unusual uses training data 
indicated that an appreciable increase 
in originality did not occur during 
training. In the case of training with 
free association materials a marked in- 


occur 


crease does take place. 

\nother experiment was conducted 
in which different groups received 0, 
1, 5, or 10 repetitions of the initial 
word association list before testing on 
another word association list and the 
unusual uses. <A_ significant trend 
was evidenced on both originality tests, 
with a single repetition of the training 
list producing a significant increase in 
originality (Maltzman et al., 1960). 

On the basis of the current dearth of 
relevant information, we can only spec- 
ulate as to why training with unusual 
uses failed to induce an increase in 
the uncommonness of the responses. 
One hypothesis that may be offered is 
based upon the apparent relatively lim- 
ited number of different uses available 
for each item. Given the instructions 
to write a different use to each repeti- 
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tion of an item and the inability of Ss 
to do so with a different operant, they 
respond with essentially synonymous 
uses repeatedly, a condition, as we 
have already seen, not likely to facili- 
tate originality. 

If, as we have assumed, originality 
can be learned according to the prin- 
ciples of operant conditioning, then 
the effects of originality training 
should be of the same kind as those 
ordinarily obtained in experimental 
studies of learning. One of the gross 
characteristics of learning is that thie 
behavioral changes produced by ‘ein- 
forcement persist for some time, are 
not relatively transistory. Increz.sed 
originality induced by the training 
method previously described should 
therefore also persist. An experiment 
designed to investigate this property 
of originality training has been con- 
ducted, and positive results have been 
obtained (Maltzman et al., 1960). 
Using the standard control and experi- 
mental conditions of the previous ex- 
periments, a delay of approximately 
one hour was interposed between the 
training and test situations for a con- 
trol and an experimental group, and 
a delay of two days for another con- 
trol and experimental group. A sig- 
nificant training effect was obtained on 
the word association and the Unusual 
Uses Test of originality. 

These results show that the effects 


of originality training persist to a sig- 
nificant degree for at least two days, 
under the given experimental condi- 


tions. We may conclude from these 
results and that of the previous ex- 
periment that the standard experi- 
mental procedure for training original- 
ity produces some kind of learning. 


RELATION OF ORIGINALITY TRAINING 
TO LEARNING TO LEARN 


Originality training studies of the 
kind reviewed here and learning to 
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learn studies (Duncan, 1958; Harlow, 
1949; Morrisett & Hovland, 1959) 
have the common characteristic of 
producing relatively nonspecific trans- 
fer effects. Training on given stimuli 
transfers to other stimuli in the absence 
of any obvious opportunity for the 
operation of primary stimulus general- 
ization. Despite the apparent similar- 
ity between the results of training in 
learning to learn and those discussed 
here, differences in the experimental 
operations and the kind of behavior in- 
duced in these procedures suggest that 
different principles are involved in 
these situations. 

The learning to learn procedure in- 
volves the learning of a great many 
different discrimination problems. 
Different stimuli are presented repeat- 
edly until S has learned a differential 
response to a criterion or has received 
a given number of trials. The re- 
sponses are not varied and uncommon 
responses are not reinforced, nor can 
they be, because a specified response 
is correct. Under these conditions Ss 
show a progressive decrease in the 
number of trials required to learn new 
discrimination problems. In original- 
ity training, on the other hand, vari- 
ability of response is induced. Re- 
peated presentation of the same stimuli 
and reinforcement of a specific pre- 
designated response likewise does not 
occur in originality training. In learn 
ing to learn, the stimuli vary from 
problem to problem, but the responses 
do not. In originality training and 
test situations the stimuli and the re- 
sponses vary. Thus the two proce- 
dures differ in terms of the operations 
employed to induce nonspecific trans- 
fer. Furthermore, the interpretations 
offered by Duncan (1958) in his com- 
prehensive study of learning sets at 
the human level do not appear appli- 
cable to originality training. The ac- 
quisition of observing responses, habits 
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of looking at details of stimuli, etc. 
are important in discrimination prob- 
lems but do not appear to be particu- 
larly relevant in originality training 
situations of the kind described here. 
Likewise, the analysis of learning to 
learn by Morrisett and Hovland 
(1959) in terms of the discrimination 
between stimulus trace patterns does 
not appear applicable to originality 
training. 


\N INTERPRETATION OF ORIGINALITY 


TRAINING 


Two basic problems, not unrelated, 
posed by the experiments on original- 
ity training by Maltzman and his asso- 
ciates (1958, 1960) demand an inter- 
pretation. First is the nature or 
source of the reinforcement for origi- 
nality. In all of the experiments, suc- 
cessive training trials in the standard 
experimental conditions are accom- 


panied by increases in the uncommon- 
ness of the responses. 


The originality 
induced in this manner transfers to 
different stimulus materials, and these 
effects tend to persist. Originality 
training thus produces behavioral 
changes that are characteristic of learn- 
ing, yet they occur in the absence of 
differential reinforcement administered 
by E. Nevertheless, we believe that 
uncommon responses received differ- 
ential reinforcement. Reinforcement 
was obtained, because the occurrence 
of, any previously established intra- 
verbal association is self-reinforcing. 
\s Thorndike puts it: “The mere oc- 
currence of a connection strengthens 
it” (1949, p. 23). Thus, where there 
is an initial probability greater than 
zero that a given verbal stimulus will 
evoke a verbal response in an operant 
situation such as word association, the 
occurrence of that response will in- 
crease the probability of its occurrence 
on a subsequent presentation of that 
stimulus. Different initial conditions 
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may only increase or decrease the 
amount of reinforcement affecting the 
stimulus-response connection. In the 
type of experimental situation with 
which we are concerned here, the un- 
commonness of the response evoked by 
the stimulus is one of these conditions. 
In other words, the amount of rein- 
forcement is inversely related to the 
initial probability of a verbal stimulus 
evoking a verbal response. Originality 
is more reinforcing than commonplace 
responding. 

An impression gained from observ- 
ing Ss in the experimental situation is 
that repeated evocation of different re- 
sponses to the same stimuli becomes 
quite frustrating: Ss are disturbed by 
what quickly becomes a surprisingly 
difficult task. This disturbed behavior 
indicates that the procedure may not 
he trivial and does approximate a non- 
laboratory situation involving origi- 
nality or inventiveness, with its fre- 
quent concomitant frustration. This 
impression of frustration accompany- 
ing the task of producing repeated un- 
common responses would lend support 
to the notion that when such responses 
do occur they are self-reinforcing to a 
considerable degree. These specula- 
tions, however, are not essential to the 
investigation of the initial hypothesis 
of the self-reinforcing character of un- 
common responses. However, these 
as well as other considerations suggest 
that the distribution and scheduling of 
the evocation of uncommon responses 
may be of great importance in deter- 
mining the extent to which a trans- 
fer effect will be obtained. 

It should be noted, however, that 
the role of self-reinforcement is prob- 
ably less significant in producing an 
increase in originality during training, 
the successive repetitions of the same 
word list, than in the test situation 
Since instructions to the standard ex 
perimental group prohibits the reoc- 
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currence of the same response to a 
given stimulus word, successive re- 
sponses to that word will be progres- 
sively lower in the related response 
hierarchy. This is sufficient to account 
for the increase in the uncommonness 
of responses with successive repetitions 
of the same stimulus words. How- 
ever, we believe that a second variable 
contributes to the increase in origi- 
nality during the training session, gen- 
eralization of the reinforcement effects 
from the evoked response to still more 
uncommon responses in the same 
hierarchy. However, the role of self- 
reinforcement becomes _ significant 
when the transfer of originality to the 
test situations is considered. In this 
case the carrying out by S§ of previ- 
ously administered instructions to give 
different responses will not account 
for the greater originality of the stand- 
ard experimental condition in the test 
situations. 

This leads to the second problem 


presented by the experimental data: 
the fact that the evocation of uncom- 
mon responses on one list of stimulus 


facilitates the occurrence of 
original responses on a different list 
of words, and even more perplexing, 
on the rather different Unusual Uses 
Test. The problem is to account for 
the fact that reinforcement of one in- 
stance of original behavior increases 
the response tendency for other in- 
stances of such behavior under differ- 
ent stimulus conditions. A relatively 
nonspecific kind of transfer of training 
is-induced. Without it, the facilitation 
of originality would not be possible. 

We do not have any simple rigorous 
explanation of this phenomenon at the 
present time, but offer the following 
tentative suggestions. The intraverbal 
associations possessed by a normal 
educated adult are enormously exten- 
sive and complex. In all likelihood, 
almost every verbal response is associ- 
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ated to some extent with every other. 
Furthermore, the intraverbal associa- 
tions among common verbal responses 
are stronger than between common 
and uncommon responses. Likewise, 
the intraverbal associations among un- 
common responses are relatively 
stronger than between uncommon and 
common. The evocation and rein- 
forcement of uncommon _ responses 
will, through complex kinds of medi- 
ated generalization, therefore increase 
differentially the probability of occur- 
rence of other uncommon responses 
even though they do not ordinarily) 
occur to the same stimuli as the orig- 
inal reinforced responses. In a similar 
fashion, the evocation of common re 
sponses would differentially increase 
the probability of occurrence of other 
common responses even though they 
may be members of different response 
hierarchies. An additional character 
istic of the standard experimental pro- 
cedure used in originality training is 
that it provides for the inhibition of 
common responses as well as the fa- 
cilitation of uncommon responses. 
Through mediated generalization the 
effects of inhibition will produce a dec- 
rement in the excitatory potential of 
other common responses. It is this 
characteristic which may be responsible 
for the more general transfer effects 
obtained with the standard experi- 
mental procedure than with the proce- 
dures that induce uncommon textual 
responses or employ uncommon stimu- 
lus situations for evoking, in turn, un- 
common responses. None of these 
other procedures would involve the 
inhibition of common responses. 

It should be clear that the foregoing 
hypotheses are speculative and their 
connection with the data presented is 
tenuous. Considerable careful experi- 
mentation tracing the intraverbal as- 
sociations present in the experimental 
situation is needed as well as the de- 
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velopment of laws of verbal com- 
pounding before a completely adequate 
explanation of the basis for originality 
training can be given. Whether or not 
such explanations can be given, the 
method of training originality de- 
scribed in this paper seems worthy of 
further study in its own right, and 
potentially may have considerable prac- 
tical application. This again is a prob- 
lem requiring sustained experimental 
research employing different kinds of 
test materials. 

In conclusion, it should he empha- 
sized that little is to be gained by dis- 
puting the usage of the term “origi- 
nality” as it has been employed here, 
or whether the current experiments de- 
scribed are investigating the essence of 
originality. The important problem is 
the experimental one of determining 
the variables influencing the occur- 
rence of uncommon responses in rela- 
tively simple situations and the func- 
tional relationships that obtain in these 
situations. Finally, the extent to 
which these effects hold for more com- 
plex behavioral situations must be de- 
termined. Such work is now in prog- 
ress in the UCLA laboratory. But to 
designate only highly complex _be- 
havior of a given sort as original to 
the exclusion of simpler better con- 
trolled behaviors seems gratuitous, as 
well as an obstacle to the progress of 
research in this vital area. 


SUMMARY 


The basic problem in the training of 
originality is to devise a means of in 
creasing the frequency of uncommon 
Once it takes place it may 
receive reinforcement and increase the 
probability that other original behavior 


behavior. 


will occur. Earlier attempts to devise 
training methods for originality were 
briefly mentioned, as well as related 
studies of problem solving. 
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\ series of experiments by Maltz- 
man and his associates was reviewed, 
and a procedure which consistently 
facilitated originality was described. 
This procedure involves the repeated 
presentation of a list of stimulus words 
in a modified free association situation 
accompanied by instructions to give a 
different response to each stimulus. 
Under these conditions the responses 
become more uncommon. When pre- 
sented with new stimulus materials, Ss 
receiving such training are reliably 
more original than Ss receiving no 
training. Tentative suggestions as to 
the behavioral bases for the training 
effect were given. 
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CONDITIONING OF AUTONOMIC RESPONSES’ 
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/°xperiments on conditioning with hu 
man subjects (Ss) are complicated by 
the fact that response change or “learn- 
ng” can take place at different levels, 
ometimes facilitating and other times 
interferring with the experimenter’s 
Razran (1955) empha- 
sized this point when he distinguished 
imong five relations of conditioning and 
perception. Of his categories two are 
particularly relevant here, representing 
. distinction between ‘conditioning 
without perception” and “conditioning 
with perception,” the difference de- 
pending largely upon whether the rela- 
tion between the unconditioned reaction 
ind the to-be-conditioned 
perceived by S. 

As a specific example consider stud- 
ies conditioning the GSR or change in 
heart rate with human Ss. Over twenty 
ago Cook and Harris (1937) 
demonstrated one trial “verbal condi- 
tioning” of the GSR in which the 
strength of the response was not de- 
pendent upon the number of reinforce 
ment trials. They concluded, not only 
that the GSR is verbally conditioned, 
but also that such “verbal conditioning 
is impossible to prevent in conditioning 
by the usual procedure.” In spite of 
this early result, most GSR “‘condition- 
ing” studies since that time appear to 
have been conducted either ignoring 
verbal-perceptual factors or assuming 
that they are controlled by instructions 
designed to mislead the Ss concerning 
the nature of the experiment. 


(E’s) goals. 


reaction is 


years 


1 This paper is an outgrowth of a program 
of research supported by grants from the 
National Science Foundation 


When the “perceptual” problem has 
been attacked directly the favored ap- 
proach has been to manipulate ver- 
bal and attitudinal conditions through 
changes in instructions to §, informing 
him of stimulus relations or asking him 
to take a voluntary part in the process. 
Early studies of this type were summa- 
rized by Hilgard and Marquis (1940). 
Later studies have not emphasized 
autonomic responses, although Lacey 
and Smith (1954) and Lacey, Smith, 
and Green (1955) demonstrated heart 
rate conditioning differences between 
“aware” and “unaware’”’ Ss; and Grings 
and Kimmel (1959) noted marked in- 
struction effects in GSR discrimination 
learning. References to the evanescent 
nature of GSR conditioned responses 
are common (Grant & Schiller, 1953. 
Grings & Shmelev, 1959), and the im- 
plication is usually made that the GSR 
is sensitive to transient changes in hu- 
man Ss. 

The above is ample to suggest that 


the handling of verbal perceptual set 


factors in human autonomic condition- 
ing remains a problem of major impor- 
tance. Data available are limited to 
demonstrations of the existence of per- 
ceptual factors, with very few attempts 
to define operations for their detection, 
evaluation, and control. Extension of 
understanding of the conditioning proc- 
ess with human Ss may result from 
exploration of new operations for in- 
ferring and manipulating perceptual 
sets in situations resembling classical 
conditioning. 
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lic. 1. Average responses on disparity 
trials and nondisparity trials for groups with 
different numbers of prior reinforcements 
Tone and light stimuli. Response magnitude 
in square root of conductance change in 
micromhos. 


PERCEPTUAL DISPARITY 
RESPONSE 


THI 


The present research introduces an 
operation for inferring the existence of 


a perceptual set in conditioning studies, 
one that is not based on verbal instruc- 


tions to the S. The concept is based on 
the premise that human Ss, in situa- 
tions with conditioned stimuli preced- 
ing noxious stimulation, use the cue as 
a preparatory signal—that is, the re- 
sponse to the conditioned stimulus (or 
what is, in effect, the conditioned re- 
sponse) is partly a preparatory set for 
receipt of the unconditioned noxious 
stimulus. 

It is proposed that the strength of a 
perceptual set aspect of conditioned re- 
sponses can be estimated from per- 
ceptual disparity situations, or from 
situations in which a cue stimulus is 
followed by an unconditioned stimulus 
different from that which had been as- 
sociated with that cue previously. In 
other words, it is assumed that there 
will be a difference in magnitude of 
response between situations where re- 
ceipt of stimulation is in accord with 
past experience and where such receipt 


of stimulation is not in accord with past 
experience. It is further assumed that 
a perceptual disparity variable, so de- 
fined, will be related to meaningful de- 
termining conditions in human behavior 
and lead to explanatory possibilities 
which go beyond more conventional 
definitions of conditioned responses. 
To test the feasibility of such a dis- 
parity variable, several experiments 
have been conducted and will be de- 
scribed briefly below along with later 
studies amplifying upon the concept. 
In one experiment, three groups of six 
Ss each were taught that when they 
moved a lever in one direction they 
would receive a tone stimulus and when 
they moved the same lever in the op 
posite direction they would receive a 
bright light. The groups were given 
different numbers of training trials 
(four, six, and eight) during which the 
above regularities applied. Then they 
were administered a test series contain- 
ing both disparity and nondisparity 
trials. Disparity was achieved by fol- 
lowing with a tone the direction of 
movement which previously had been 
followed by light, or by following with 
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Fic. 2. Average responses on disparity 
trials and nondisparity trials for groups with 
different numbers of prior reinforcements 
Tone and electric shock stimuli. Response 
magnitude in square root of conductance 
hange in micromhos 
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a light the direction of movement which 
had previously been followed by a tone. 

The results from the test series are 
plotted in Figure 1 where it can be seen 
that the magnitude of response on dis- 
parity trials was larger than the re- 
sponse on the nondisparity trials for 
all groups and the difference between 
the curves increases with larger num- 
bers of training trials. The difference 
between the disparity and nondisparity 
responses tested overall by a ¢ test was 
significant at the 1% level; the effect 

the number of reinforgements var- 
iable tested as an analysis of variance 
of the subgroup mean differences be- 
tween disparity and nondisparity re- 
sponses was not significant. 

\ second, similar study (Grings & 
Kimmel, 1958) was conducted using 
tone and shock stimuli rather than tone 
and light as UCSs. Sixty Ss were ob- 
served, 30 being given 4 reinforcements 
before test and 30 being given 10 rein- 
forcements before test. Again the test 
series contained both disparity and non- 
disparity trials. The results, shown in 
Figure 2, agree with those in the first 
study, i. 
(GSR 


, for both groups the response 
on disparity trials is larger 
than on nondisparity trials (significant 
at the 1% level) and the size of the 
difference between the disparity and 
nondisparity responses increases as a 
function of the number of reinforce- 
ments. 


On | 


the basis of these and related 


studies it was felt that the assumption 


of a disparity variable was reasonable, 
and attention was turned to the manip- 
ulation of introducing 
appropriate variation to 
which it might be related 


the variable by 


independent 


DETERMINERS OF THE PERCEPTUAL 
DISPARITY RESPONSE 

On the cited 

studies it was hypothesized that at least 


lasses of determiners of the 


the basis of already 


two major c 
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One involves 
the strength and stability of the antic- 
ipatory response, which in turn appears 
to be a function of the number of rein- 
forcements of the cue-consequent rela- 
tions. If preparatory set responses in- 
crease and stabilize as most habits do, 
the more frequent the sequence cue- 
reinforcing stimulus the more strong 
and consistent should be the organism’s 
preparation for the reinforcing stim- 
ulus, other variables held constant. 
Elaboration of this point, however, 
needs to be made, for the operation 
of competing processes, like response 
adaptation, is clearly evident in Figures 
1 and 2. Further discussion of this 
matter is made later. In addition, nu- 
merous factors related to the 
tioned stimulus used (such as its in- 
tensity, discriminability, etc.) probably 
determine the strength of the condi- 
tioned preparatory response. 

The second major class of determiner 
of the magnitude of the disparity re- 
sponse would include the direction of 
the disparity—i.e., whether actual stim 
ulation was stronger or weaker than 
that for which S was prepared—and the 
amount of disparity measured on some 
scale of stimulus distance from antic- 
ipated to actual stimulation. 

Several studies have been conducted 
to explore the hypotheses outlined 
above, two examples of which are given 
below. One of these has been reported 
by Kimmel (1960). He was testing 
the relation between amount of classical 
conditioning of the GSR and the in- 
tensity of the CS. Tone CSs of inten- 
sities of 35 


disparity response exist. 


condi- 


.0002 dyne/cm? were used with an 
electric shock UCS. In attempting to 
control differences in response strength 
to cue stimuli during an extinction 
series he subdivided each acquisition 
CS intensity group into three extinc 
tion intensity subgroups (45, 75, and 
105 db Since at the time of the first 
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Fic. 3. Magnitude of perceptual disparity 

response as a function of amount and direc- 


tion of stimulus change. (Reproduced from 
Kimmel, 1960.) 


extinction trial S had just been given 
20 trials during which the first stimulus 
he received on each trial was the acqui- 
sition intensity, it may be assumed that 
hetween trials he remained prepared 
for that intensity. When, therefore, he 
received an intensity different from the 
previously experienced one, he might 
respond to the disparity. Kimmel’s 
data are presented in Figure 3 in 
terms of GSRs resulting from different 
amounts and directions of change. The 
effects of both direction and amount of 
change were reported as statistically 
significant. 

Similar results were obtained in a 
study by Grings, Dossett, and Honnard 
(1959). Forty-eight college students 
were randomly separated into two ex- 
perimental groups. One group was 
given conditioning training with 100% 
reinforcement, the other with 67% re- 
inforcement. Two cue stimuli (colored 
light CSs) were followed by a 50-db. 
tone or a tone of different intensity 
Four subgroups of 6 Ss each were 
trained to different second stimuli of 
either 30, 70, 90, or 110 db. re: .0002 
dyne/em*. <A_ standard f 
eight reinforcements of each cue-tone 
combination was followed by a dis- 
parity trial. Since each disparity trial 


number of 
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was made up of the cue to the 50-db 
stimulus followed by use actual “other” 
stimulus, there resulted four different 
degrees of discrepancy: —20 db., 20 
db., 40 db., and 60 db. 

The results of this experiment are 
presented in Figure 4 where mean re- 
sponse to the UCR component of the 
disparity trial (adjusted for reference 
level of the UCS involved) is given as 
a function of amount and direction of 
disparity. When an analysis of var- 
iance was made on the data shown, the 
difference between the 100% and 67“ 
groups was found to be significant bx 
yond the 5% level (F = 6.80, df = 2 
and 6). The difference between the 
disparity subgroups, as based on the 
values portrayed by the combined 
groups curve in the figure, fell just 
short of the 5% level of significance 
(F = 2.25, df = 4 and 6). 

On the basis of the above studies. it 
is tentatively concluded that the greater 
the amount of discrepancy between 
anticipated and actual stimulation the 


100% 


sd COMBINED 
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Fic. 4. Mean responses to the UCR com 
ponent of the disparity trial, adjusted for 
reference level of the UCS involved, for 
groups differing in the percentage of prior 
trials reinforced. 
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Fic. 5. Comparison of conditioned heart 
rate responses for groups differing in aware- 
ness of CS—UCS relations. (Adapted from 
Lacey, Smith, & Green, 1955.) 

greater the GSR elicited by the actual 
stimulus. Discrepancies in an upward 
direction appear to yield larger dis- 
parity responses than discrepancies in 


a downward direction. In addition, 


these factors appear to be a function of 


the number of reinforcements and the 
percentage of reinforcements employed. 


APPLICATIONS OF THE DISPARITY 
CONCEPT 


Thus far the emphasis has been upon 
the definition of the disparity response 
and the exploration of some of its de- 
termisiing conditions. Next, attention 
is paid to application of the disparity 
situation in attempts to clarify some 
problems of human conditioning. 

One of the problems that has con- 
cerned the writer is the nature of the 
GSR acquisition “learning” curve. In 
other words, what is the effect of the 
number of trials (or reinforcements) 
on the magnitude of a conditioned au- 
tonomic response? If the conditioned 
GSR, for example, is determined by 
interactions of different processes (i.e., 
nonperceptual and perceptual ), it would 
be predicted that at times one process 
or the other might be dominant. One 


result could be that a “typical” learning 
curve would not be obtained: that is, 
GSR magnitudes as a function of num- 
ber of reinforcements would not be an 
increase in CR magnitude with.an in- 
crease in trials. 

That different “learning” curves 
might be expected from different per- 
ceptual situations has been demon- 
strated by Lacey, Smith, and Green 
(1955). It will be noted from Figure 
5 that they found “unaware” Ss gave 
increasing heart rate CRs with more 
trials, whereas “aware” Ss showed de- 
creasing CRs with more trials. Sim- 
ilarly, Kimmel (1959), with GSR con- 
ditioning in this laboratory, found 
magnitude of anticipatory CRs to in- 
crease up to eight reinforcements, then 
to decrease. His data are given in 
Figure 6. The implication in both 
cases is not that less “learning” is 
occurring with more reinforcements but 
rather that the perceptual aspects of the 
situation are probably changing. 

If it might be presumed that per- 
ceptual set factors are stabilized by re- 
peated trials where a given CS-UCS 
relation obtains, it might be proposed 
that as long as no irregularities occur 
in the training situation S will cor- 
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rectly ‘“‘set”’ for the noxious UCS and 
therefore will show a curve of adapta- 
tion with successive stimulation (just 
as he would to a series of presentations 
of the UCS alone). The result will be 
that anticipatory CRs and UCRs may 
both show reduced amplitude with in- 
creased trials. 

It might be argued further that it is 
the perceptual tendencies or sets which 
are becoming stronger with increased 
experience or reinforcements. If the 
disparity response is taken as evidence 
of perceptual set strength, it can be 
predicted from data already at hand 
(Figures 1 and 2) that the disparity 
variable (the difference between the 
response magnitudes on disparity and 
nondisparity trials) does increase with 
larger numbers of reinforcements. It 
was also found (Figure 4) that the dis- 
parity variable was greater for 100% 
than for 67% reinforcement conditions. 

The proposal here, then, is that the 


disparity response may show a differ- 
ent course of development (with regard 


to the number-of-reinforcements var- 
iable) than will a simple anticipatory 
CR. Depending upon conditions it 
may provide a “better” estimate of 
strength of preparatory set than the 
anticipatory CR does. In order to 
examine this possibility, consider a 
current experiment in which an attempt 
was made to emphasize the perceptual 
set properties of the conditioning envi- 
ronment, 

Taking a lead from Cornbecker, 
Welch, ‘and Fisichelli (1949) the ex- 
periment was designed to test the no- 
tion that reinforcing a class of cues 
intrinsically suggesting stimulation will 
lead to increased perceptual response 
tendency (to expect stimulation) on a 
cue not reinforced but capable of sug- 
gesting stimulation. The hypothesis 
was tested by presenting a series of 
seven-word stimuli which were appro- 
priately reinforced during the acquisi- 


tion period, i.e., the words “soft music” 
were followed by a few soft 
orchestral music, the words “cool 
breeze” were followed by a brief stim- 
ulation by an electric fan, “loud tone” 
followed by actual tone, “green light”’ 
by actual green light, etc. After either 
14 or 49 such “reinforcements” a crit- 
ical word or neutral word was intro- 
duced in a test series which also in- 
cluded a disparity trial (cue “green 
light” followed by the actual loud 
tone). 

Since in all cases the cue was pre- 
sented for 5 seconds before the rein- 
forcing (UCS) stimulus was presented, 
it became possible to trace the course 
of anticipatory responses following any 
particular cue. Also, since long and 
short reinforcement were em- 
ployed, the effect of different numbers 
of reinforcements on both the regular 
and the disparity trials could be ob- 
served. The previous development 
would lead to the prediction that antic- 
ipatory responding (to the cue “loud 
tone”) would become less with more 
reinforcements, as would also the 
response to the actual tone, due to 
adaptation. 


bars of 


series 


On the other hand, the 
disparity response to tone following an 
other (“green light”) cue would in- 
crease with greater numbers of rein- 
forcements. 

The results for this comparison for 
20 Ss receiving 14 reinforcements prior 
to the disparity trial and 20 Ss receiv- 
ing 49 reinforcements prior to the dis- 
parity trial are shown in Figures 7 
and 8. The data follow the prediction 
closely, the anticipatory responses show- 
ing significant decrease from short to 
long reinforcement series (t = 2.64, df 
= 38) and the disparity responses 
showing a significant increase over the 
same period (t = 2.23, df = 38).? 

2 Essentially the same relations were ob 


tained with changes in digital blood flow as 
the response variable. 
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Explanation of the above results has 
proceeded from familiar behavioral pos- 
tulates concerning reinforcement and 
perceptual sets. The concept of rein- 
forcement designated an empirical state 
of affairs with specified temporal reg- 
ularities obtaining between classes of 
stimuli (CSs followed by UCSs). 
Similarly, the perceptual set concept 
was related to such general notions 
as orientative-investigative reactions of 
Pavlov (1927), anticipatory goal re- 
sponses of Hull (1930), and the S-R 
form of Tolman’s expectancy (Mac- 
Corquodale & Meehl, 1954). It was 
later pointed out to the writer that 
experimentation on disparity variables 
using different descriptive terms had 
been carried out in the laboratory of 
Anokhin in Russia. This led to fur- 
ther considerations, as follows. 

Anokhin (1955) was interested in a 
theory of adaptive behavior 

Id | to integrate the ac- 


1 
uid herp 


general 
which 

physiologists and psychol- 
s central notions is 
that of “reverse afferentation,” a proc- 
ess of reverse signalization or feedback 
which follo reflex act “like an 
echo” and which informs the organism 
whether the adaptive effect appropriate 


tivities of 
ogists One of hi 
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ws each 
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Fic. 7. Anticipatory CRs (magnitude of 
GSR during delay interval) as a function of 
the number of reinforced trials. 
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Fic. 8. Magnitudes of disparity response 
(difference between responses to the UCS on 
disparity and nondisparity trials) as a func- 
tion of the number of reinforced trials. 


to the situation has been achieved or 
not. This afferent process provides the 
necessary conditions to the organism’s 
stopping activity or passing on to other 
acts comprising the chain of adaptation. 
The notion of “reinforcement” can be 
applied to any fractional stage of adap- 
tation which has obtained correspond- 
ing reverse afferentation. He talks 
about “sanctioning afferentation” which 
stops further attempts at organization 
of new reflex acts and strengthens the 
combination of excitation in the 
centers of the brain which gave a suc- 
cessful adaptive effect at the periphery. 

On the question of “How does the 
organism tell the sanctioning properties 
of the afferentation?” Anokhin brings 
in the concept of “intention.” To be 
effective reverse afferentation must cor- 
respond to our intention, that is, it must 
correspond to some provisional com- 
plex of excitations which arose before 
the reflex itself was formed. At the 
time of the arrival of the reverse af- 
ferent impulses from the UCS to the 
cortex the behavior will be stable only 
to the extent that the UCS excitation 
corresponds accurately to the prepara- 
tory afferent excitation which was 


last 





WILLIAM 


TABLE 1 


CoMPARISON OF GSR RECOVERY QUOTIENTS 
ro Various UCSs BEFrore AND AFTER 
DISPARITY EXPERIENCE (N = 18) 


Before Disparity 
Experience 
\fter Disparity 
Experience 
' 


i ratio 


P 


brought to life by the CS several sec- 
before the reinforcement. Put 
another way, the afferent apparatus 
produces a conclusive evaluation of the 
adequacy or inadequacy of the rein- 
forcement signal. 
The physiological basis for this process 


onds 
which follows the 


is conceptualized as a cortical “ac- 
ceptor” apparatus. 


An experiment cited by Anokhin em- 
phasizes the closeness of his work to 
that being reported from the USC lab- 
Two secretory CRs were es- 


oratory. 
tablished, one a tone “la” with rein- 
forcement with rye rusks on the right 
side of the bench and the other a tone 
“fa” with reinforcement with rye rusks 
on the left side. During the acquisition 
phase no erronecus motor reactions 
were made—the animal went right after 
“la” and left after “fa.” Then, on a 
test day dry meat was placed with the 
rusk on the left side. The result was a 
brief investigative reaction and rapid 
ingestion of the 


“é 


meat, after which the 
previous motor patterns became dis- 
rupted. 
given the animal went to the left, sug- 
gesting that the eating of the meat led 
to a dominance of the reaction to the 
left. Further observation with rusk- 
only reinforcements showed refusal to 
eat the rusk and marked changes in 
the animal’s behavior, suggesting what 


Regardless of which CS was 
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Anokhin neurotic 
state.” 

The emphasis in Anokhin’s experi- 
ment would appear to be on the disrup- 
tive effect of disparity situations rather 
than the basic relation between simple 
reinforcement concepts and the concept 
of sanctioning reverse afferentation. 
However, if one considers the matter 
closely, there are numerous implications 
for extending the perceptual disparity 
research described in this paper At 
least one example will be considered. 
From Anokhin’s experiment one can 
assume that disparity situations pro- 
duce marked change in the course of 
the “reinforcement” process. In_be- 
havioral terms the occurrence of a dis- 
parity trial would alert the organism, 
lead him to expect a change in rein- 
forcement relations. It could be argued 
that even the absence of the UCS, as in 
an acquisition test trial, may create 
The result 
would be some effect upon the acquisi- 


terms a “clear-cut 


a disparity-like situation. 


tion process. 

like the above can be 
explored by further analyzing data 
from experiments already described. 
For example, in the study discussed in 
connection with Figure 1, the disparity 
test situation contained two series, each 
with disparity and nondisparity trials. 
If the occurrence of one disparity trial 
has a disruptive effect, there should be 
a difference in responses to UCSs on 
nondisparity trials after a disparity ex- 
perience and responses to the same 
UCSs prior to disparity experience. A 
response variable which shows this type 


Possibilities 


of disruptive effect readily is the re- 
quotient first introduced by 
Freeman (1948). It is an expression 
of the percentage return of the skin 
conductance to its original level oc- 
curring within a standard latency inter- 
val. If the organism’s behavior is dis- 
rupted, the return to base level should 
be slower than when the trials are 


covery 
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proceeding smoothly. Expressed in 
terms of this experimental situation one 
would predict a slower recovery from 
response to the usual UCS after a dis- 
parity experience than before. That this 
is the case can be seen from Table 1. 

A similar effect might be expected 
with magnitude of response measures, 
that is, a larger GSR would be pre- 
dicted to a given UCS after a disparity 
trial than In the experiment 
referred this did occur, re- 
sponse magnitude changing from 1.4 


before. 
to above 


micromhos before the disparity trial to 
1.7 micromhos after. A related predic- 
tion that could be tested readily from 
numerous published studies is that, if 
an interspersed acquisition test trial has 
disparity properties (as it is proposed 
here it may have), the response to the 
first UCS after the test trial should be 
different from the response to the last 
UCS prior to the test trial. With GSR 
the magnitudes should be as indicated, 
i.e., larger UCR after disparity. 

These later considerations were in- 
nly to suggest some of the 
extended applications the disparity con- 
cept may have for explaining certain 
classical conditioning results. Other re- 
studied currently in 
order to evaluate the utility of the con- 
These extend from attempts to 
incorporate disparity notions in a ra- 

of GSR behavior (Dossett 
& Grings, 1959) to the use of the per- 
lisparity response as a measure 


troduced 


lations are beings 


ng 
cept 
tional model 


ce ptual ( 


f individual differences. 
SUMMARY 


\ set of 


existence of a preparatory set variable 


operations for inferring the 


in human autonomic conditioning has 


been introduced and described. Termed 
a per eptu 1] disparity res ponse it is de- 
fined as the difference in magnitude of 
response between situations where re- 


ceipt of stimulation following a signal 
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cue is in accord with past experience 
and where receipt of stimulation is not 
in accord with past experience with the 
particular cue. Experimental data dem- 
onstrating the disparity variable are 
presented. 

The assumption that the perceptual 
disparity response would be related to 
meaningful determining conditions in 
human behavior was evaluated. It was 
found that the disparity response: mag- 
nitude was a function of the number 
and proportion of reinforcement trials 
and of the amount and direction of the 
stimulus change involved. 

In an attempt to extend the applica- 
tion of the disparity concept in human 
conditioning, two further situations 
were discussed. One concerned the re- 
lation between magnitude of disparity 
response as a function of number of 
training trials, and compared this to the 
common learning function relating an- 
ticipatory CRs and number of training 
trials. The second situation evaluated 
the disruptive effect of a disparity ex- 
perience in a sequence of regular rein- 
forcement trials. 

Brief reference was made to similar 
concepts and experimental work being 
time in 


carried out at the present 


Russia. 
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THE PLEASURES OF SENSATION’ 


CARL PFAFFMANN 


Brown University 


That the senses are the channels of 
communication between the external 
environment and the nervous system 
is so much a truism that, except for 
those who espouse ESP, the senses are 
held to be the sole mediators of trans- 
actions between the organism and its 
environment. But students of the sen- 
ses tend to emphasize primarily the 
nature of their information handling 
function, their discriminative capacities 
and limits as well as the physiological 
processes upon which they are based. 
Behavior theorists have given heed to 
such function in another context when 
they speak of the cue function of stim- 
uli. According to these conceptions 
discriminative stimuli serve to steer be- 
havior, to guide it or set the stage for 
the response. But in this paper I do 
not plan to elaborate upon such proc- 
Rather I wish to call attention 
to certain other aspects of sensory func- 
tion, 

In recent years, neurophysiologists 
have emphasized that sensory systems 
may mediate important functions other 
than discrimination. Attention has been 
focused on such generalized effects as 
arousal or alerting of the nervous sys- 
Bremer (1935) first 
showed that a state of cerebral somno- 
lence could be produced in experimental 
animals in which the cerebrum had been 
surgically isolated from the brainstem 
and its incoming afferent pathways. He 
attributed this to the interruption of 
influx itself. Later analysis 
(Magoun, 1958) showed that the som- 


esses. 


tem as a whole. 


sensory 


1 Slightly revised version of Presidential 
Address presented at the Annual Meeting of 
the Eastern Psychological Association, At- 
lantic City, April 13, 1959. 


nolence resulted largely from interfer- 
ence with the centripetal influence ex- 
erted by the reticular system, the 
ascending reticular activating or arousal 
system. Much subsequent research has 
shown that this system controls not 
only the state of arousal of the “higher 
centers” of the nervous system, but that 
it can also modulate and influence ac- 
tivity within the classical sensory path- 
ways themselves (Galambos, 1956; 
Hagbarth & Kerr, 1954; Hernandez- 
Peon, 1955). The reticular system 
may also be significant for a variety of 
other psychological processes (Linds- 
ley, 1951; Schlosberg, 1954). 

Most sense organs were found to 
have at least two central neural path- 
ways. One, the well-known primary 
projection pathway via the thalamic 
sensory relays to the cerebral cortex, 
and the second, a nonspecific pathway 
by way of the reticular activating sys- 
tem with diffuse projections to the 
cortex and other neural structures. The 
reticular system is multimodal and in- 
tramodal, for it receives inputs from 
many modalities. The classical projec- 
tion systems are specific to one modality 
and may be said to mediate primarily 
the cognitive or discriminatory sensory 
functions, the nonspecific system to 
mediate physiological and behavioral 
arousal (Magoun, 1958). 

Psychologists especially have empha- 
sized that stimuli may have still other 
functions, especially that of reinforce- 
ment (Hebb, 1958b; Keller & Schoen- 
feld, 1950; Skinner, 1953; Spence, 
1956). This concept has most often 
been discussed in purely behavioral 
terms until the more recent studies 
of direct reinforcement by intracranial 
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stimulatto:rt (Brady, 1958; Miller, 1957 ; 
Olds, 1958; Olds & Milner, 1954). In 
the ensuing discussion I shall limit my- 
self to the so-called primary reinforce- 
ment for there is almost no limit to the 
range of previously neutral stimuli that, 
by one method or another, can be made 
to acquire reinforcing properties. That 
primary aversive reinforcement has been 
a function of stimuli has long been 
known, particularly for those stimuli of 
high intensity which elicit defense re- 
actions or. “reflexes.” Certain of these 
seem qualitatively more prepotent as, 
for example, the pain of electric shock ; 
“rat runners” have frequently used 
shock as a primary negative reinforcer 
in theif mazes or lever boxes. Some 


theorists hold that the pain reduction 
by shock termination provides the pro- 
totype of all reinforcement in the form 
( Miller, 
The study of stimuli as posi- 
tive primary reinforcers has, until re- 


of drive or tension reduction 
1959). 


cently, been restricted to those stimuli 
which are naturally related to such bio- 
logically functional activities as eating, 
drinking, or sexual activity. These 
stimuli could often be assigned a role 
in processes that mediated the satisfac- 
tion of a need or “homeostatic drive.” 
Thus they were said to be related in 
some to the primary biological 
drives. But there has been increasing 
evidence of late that sensory stirnula- 


way 


tion, divorced from its need or drive 
reducing concomitants, may function as 
a reinforcer in its own right. “‘Extero- 
ceptive motivation” sometimes called by 
such names as curiosity or stimulus 
change (Harlow, 1953; Hebb, 1958a; 
Kling, Horowitz, & Delhagen, 1956; 
Montgomery, 1952) has been demon- 
strated in a number of situations. And 
one theorist (Hebb, 1958a, pp. 451- 
467, 1958b) has attempted to link such 
reinforcement to the reticular activating 
system on the grounds that changes in 
level of activation are reinforcing, per 
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se, depending upon the prevailing level 
at the time of stimulation. Thus, in 
sensory isolation experiments, “the sub- 
ject experienced great swings of moti 
vation, which alternated between pe- 
riods of apathy and intense desire to get 
back to a normal environment.” Hebb 
concludes, “clearly man’s motivation is 
a function of his exteroceptive stimula- 
tion.” Thus, although the reinforcing 
function of stimuli has long been recog- 
nized and is the stock in trade of many 
experimentalists, there is still consider- 
able debate as to the basis for this 
effect. Does it depend upon some sec- 
ondary effect as need reduction, drive 
reduction, or change in arousal patterns 
in the reticular activating system? Or 
can we attribute these effects to sensory 
stimulation directly ? 

In this paper I shail discuss the prop- 
osition that sensory stimulation per se 
together with its ensuing central neural 
events be considered as a prime deter- 
minant of reinforcement. In this con- 
text I shall discuss our experiments on 
the sense of taste with emphasis on the 
sensory physiology of taste mediated 
motivation as revealed in the correla- 
tion between gustatory nerve discharges 
and food preferences in animals. This 
will lead into a consideration of the re- 
lation between hedonic processes and 
afferent nerve discharges, preference 
behavior, and taste reinforcement. Fi- 
nally the relation between affective sen- 
sory processes and discriminative func- 
tions will be discussed in the light of 
some speculations on the physiological 
bases for these two aspects of sensory 
function and their significance in be- 
havior. 


TASTE AS A MODEL SYSTEM 


Although I began my studies of the 
sense of taste with a traditional sensory 
emphasis, it became increasingly ap- 
parent that the gustatory sense has cer- 
tain unique features for the further 
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understanding of behavior in general. 
Not only does the sense of taste possess 
selective receptor sensitivity that per- 
mits the discrimination of different 
taste stimuli, but these stimuli will elicit 
a number of specific consummatory re- 
drinking and eating or, on the 
other hand, rejection. It is a relatively 
simple matter to demonstrate the con- 
trol of behavior that taste stimuli can 
manifest by 


sponses : 


giving the animal a choice 
of water and taste solutions as Richter 
(1942) showed in his classic studies of 
lf-selection. In ad 


uli cat 


these stim- 
as powerful reinforcers of 


Se lition, 
act 
instrumental responses leading to inges- 
tion. Thus one and the same sense 
modality possesses easily demonstrable 
i func- 


ideal 


discriminative and_ reinforcing 


In 


sensory 


tions this respect it is an 


mode! behavioral .system in 


which to examine the relation between 
sensory stimulation and the reinforcing 
as to 
state 
ism by ceftain deprivation 


mechanisms in behavior as well 


permit the manipulation of the 
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shows a typical asynchronous, multifiber discharge from a large number of nerve 
shows how such activity appears when processed through the 

1 from the American Journal of Clinical Nutrition.) 
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I should, therefore, like to 
review briefly some of the methods we 
have employed and some of our experi- 
ments on taste,? to show how we look 
at the sensory continuum and its af- 
ferent neural input to the CNS which 
both motivates and directs behavior. 
In so doing I shall make use of the 
behavioral data of other workers, as 
well as the results of our own phys- 
iological and behavioral experiments. 
Our behavioral studies to date have 
utilized primarily the preferential inges- 
tion method. 


operat ns. 


The Gustatory Afferent Discharge 
and Preference 


Afferent nerves, like all nerves of the 
body, carry a series of electrical pulses 
which are the signs of impulse traffic 
up the sensory nerve. These can be 

2 These experiments have been supported 
in part by projects and grants from the 
Office of Naval Research, National Science 
Foundation, and the General Foods Corpo- 
ration 
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Fic. 2. Curves of taste responses in the 
cat to four different taste stimuli as indicated 
by the integrated response method. (Repro- 
duced from the Journal of Neurophysiology.) 


recorded by appropriate electronic de- 
vices as shown in Figure 1 (cf. Pfaff- 
mann, 1959). I shall present only data 
for the over-all activity in the taste 
nerve as afforded by the integrating 
circuit illustrated in the lower trace 
in Figure 1. The gustatory response 
curves for the cat are shown in Figure 
2 in terms of the magnitude of the elec- 
trical signal generated in the nerve 
when each of the basic taste stimuli is 
applied to the tongue receptor surface. 
By such curves we can map out the 
taste sensitivity of different animals 
(Pfaffmann, 1955). These curves do 
not tell us about “qualitative” differ- 
ences in the nerve response. For that, 
analysis of the activity of single nerve 
fibers in the nerve is required (Pfaff- 
mann, 1959). 

From the point of view of behavior, 
the basic response to taste solutions is 
either one of acceptance or rejection so 
that the classical manifold of four tastes, 
salt, sour, bitter, and sweet, may be 
reduced to two behavioral classes’: ac- 
ceptance and rejection. We have used 
a typical two-bottle Richter-type pref- 
erence situation to study such behavior 
in different species (Bare, 1949; Car- 


penter, 1956; Pfaffmann, 1957). Cer- 
tain substances may be accepted at low, 
but rejected at the higher concentra- 
tions. This is so for NaCl in the rat 
where the well-known preference-aver- 
sion response can be demonstrated to 
be a function of concentration as shown 
in Figure 3. The solid line shows 
relative preference behavior 


cc. intake taste sol 


ce. intake taste + cc. intake H O 


Most acceptable solutions including 
sugar will follow this pattern when 
ingestion measures are used. Figure 3 
also shows the neural response in the 
rat chorda tympani nerve to the same 
salt solutions. The broken line shows 
the integrated neural responses as a 
percentage of the maximum response 
observed. Clearly the preference be- 
havior (i.¢., response greater than 
50%) is not apparent until some sub- 
stantial neural input has been achieved 
(20%-30% of maximal discharge). 
The peak preference occurs at about 
60% of maximal input, and the aver- 
sion rapidly sets in while the afferent 
discharge is still climbing. There is no 
change in the afferent input signal that 
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Fic. 3. Preference response to different 
salt concentrations and magnitude of the 
chorda tympani nerve discharge in the rat 
Ordinate to the left corresponds to prefer- 
ence response, ordinate to the right shows 
magnitude of integrator deflection. (Repro- 
duced from Pfaffmann, 1958.) 
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corresponds to the behavioral inversion 
at least we have not been able to 
detect any such change. Up to the in- 
version point, the behavior parallels the 
afferent discharge curve as if the posi- 
tive approach to the stimulus were due 
directly to the afferent input. The in- 
version and the subsequent aversion 
suggest the intrusion of a secondary 
“stop” system. The fact that the in- 
version point is close to the isotonic 
point for sodium chloride solutions 
(.9% NaCl [.15M] is isosmotic with 
the body fluids) suggests that some 
postingestion or metabolic factor re- 
lated to water balance become 
operative. 

Indeed Stellar, Hyman, and Samet 
(1954) have shown that the salinity of 
the gastric contents does affect the salt 
preference. Strong salt solutions in- 
tubated directly into the stomach de- 
press the drinking of the more concen- 
trated salt solutions. On the other 
hand, this is not the only stop factor 
for, in the same study, animals with 
an esophageal fistula showed the pref- 
erence for hypotonic solutions and aver- 
sions for hypertonic ones.even though 
the solutions never reached the stomach. 
Taste factors alone appear capable of 
eliciting the typical salt preference- 
aversion function. Thus the stop may 
indicate a change in “sign” of the af- 
ferent input merely as a consequence 
of its increasing intensity. Perhaps 
there is a central-neural switching when 
the intensity of the afferent salt dis- 
charge reaches a critical value. Most 
intense stimuli have aversive 
effects. 

The relation of behavior to sensory 
afferent discharge for some of the other 
basic taste stimuli is shown in Figure 4. 
The upper figure shows the preference 
response curves for sugar, acid, and 
quinine as well as salt. We show here 
only the intakes above 50% for salt and 
sugar, remembering that at higher con- 
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Fic. 4. Composite graph of behavioral 
and electrophysiological responses in the rat. 
Upper graphs show the percentage preference 
(or aversion) as a function of stimulus con- 
centration, The integrator responses in the 
chorda tympani nerve to the same stimuli are 
shown in arbitrary units in the lower graph. 


centrations, the curves fall below the 
50% value. The responses for acid 
and quinine are both aversions; no 
preference is shown. Note in the lower 
curves that the absolute magnitudes of 
neural response are quite different for 
the electrolytes as compared with the 
nonelectrolvtes. The behavioral re- 
sponse to quinine is quite definite and 
appears before there is a clear signal 
“greater than the noise” for the nerve 
response. Here the behavioral indi- 
cator is the more sensitive. In an 
analogous way, the response to sucrose 
behaviorally is clear and definite, yet 
the neural response is disappointingly 
small. Intrinsic differences in the mag- 
nitude of the neural response, due in 
part to fiber size, etc., mean that sheer 
size of electrical signal itself cannot be 
correlated with behavioral effect. On 
the other hand, the range of effective 


stimulus concentration values in the 
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for each 
stimulus shows good agreement. The 
exact form or position of the preference 
aversion curves along the abscissa may 
be shifted depending upon such factors 
as order of presentation of stimuli, de- 
gree of hunger or physiological need, 


two sets of determinations 


and amount of experience with the taste 
stimuli. It is the general nature of the 
relations between sensory input and 
preference that is shown in Figure 4. 
The behavioral response to sugar 
solutions has been studied rather exten- 
sively, and it now seems possible to give 
an account of the start and stop factors 
in relation to sensory input. The pref- 
erence curve just shown is based on 
ingestion of sugar over a 24-hour or 
longer period. Ingestion is a rising 
function of concentration to a peak pref- 
erence when the typical inflection oc- 
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Fic. 5. Composite graph showing the 
neural response to sucrose and the behavioral 
measures obtained by three different proce- 
dures in the rat. Pref. is the typical two- 
bottle 24-hour ingestion preference method. 
30th the sucrose and water intakes are 
shown as smoothed curves, based on data by 
Richter and Campbell. RR is the rate of bar 
pressing during a test period for a small drop 
of sucrose on a regular reinforcement sched- 
ule, and PR is the same on a periodic rein- 
forcement schedule, both based on Guttman 
(1953). Ordinates have been adjusted to 
make the maximum values in the original 
curves equal to 100. 
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curs and intake declines. This relation 
is found also in the single stimulus 
method. McCleary (1953) was one of 
the first to show that the intake of sugar 
solution is limited by an osmotic post- 
ingestion factor. Others have shown 
similar effects of gastric factors by 
studying the effects of intubation upon 
intake (LeMagnen, 1955; Miller, 1957; 
Smith & Duffy, 1957; Stellar, Hyman, 
& Samet, 1954). The relative fre- 
quency of choice in the brief exposure 
preference method, however, does not 
show the inflection (Young & Greene, 
1953). Here there appears to be a 
linear relation between the level of ac- 
ceptability and logarithm of concentra- 
tion. Guttman’s (1953) study of the 
rate of bar pressing as a function of 
concentration of sucrose solution used 
as the reinforcement showed that, on a 
continuous schedule, rate increased for 
the weaker concentrations but showed 
an inversion at the higher value. Ona 


periodic schedule, however, the rate of 
bar press was found to be linear with 


the log of the concentration. The latter 
schedule provides relatively little drink- 
ing per response. But bar press rate 
also can be depressed by intragastric 
injections (Smith & Duffy, 1955). 
The relations between bar pressing 
on the two schedules and the preference 
ingestion data of Richter and Campbell 
(1940) are shown in Figure 5 together 
with the electrophysiological response 
curve for the rat (Hagstrom & Pfaff- 
mann, 1959). Note that the inversion 
point in the two-bottle preference test 
is close to the inversion point of the con- 
tinuous reinforcement schedule. Both 
points lie close to the top of the electro- 
physiological sensory function. On the 
periodic schedule, rate rises as the sen- 
sory function increases. The fact that 
there is no inversion where the amount 
ingested is small appears to implicate 
the postingestion factor as a primary 


stop mechanism (Collier & Siskel, 
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Fic. 6. Integrator response magnitudes to 
three sugars for the rat chorda tympani 
nerve. Ordinate is relative magnitude, 100 
equals the response to 0.01 M NaCl. (Re- 
produced from Journal of Comparative and 
Physiological Psychology.) 


1959; Shuford, 1959). Such a formu- 
lation would agree with Young’s (1959) 
statement “that sugar solutions are bas- 


ically hedonically positive for the rat as 


compared with salt which is first posi- 


tive but then negative.” 

Studies which compared the effect of 
two different sugars, glucose and su- 
crose, on consummatory behavior, choice 
response, and rate of bar press likewise 
point to the importance of postinges- 
tive factors. Isohedonic concentrations, 
equally accepted in short comparative 

in equal 
singly for 


tests, were not consumed 
amount 
longer periods. 
consumed in 


isohedonically 


when presented 
The glucose solutions 
than 


sucrose 


were less volume 
their matched 
solutions, and the cumulative mean in- 
take of both sugars was linearly related 
to osmotic pressure for the longer pe- 
riods (Shuford, 1959). In the Skinner 
box the concentration of these two sug- 
ars which give equal rate of response 
on an aperiodic schedule (i.e., have 
equal reinforcing value) corresponds to 
the isohedonic concentrations (Shuford, 
1959; Young, 1957). Guttman (1954) 
previously showed that the equal re- 
inforcing solutions correspond to the 
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equally sweet concentrations found in 
psychophysical experiments (Cameron, 
1947; MacLeod, 1952). 

Hagstrom and I (1959) have also 
compared the relative efficacy of sucrose 
and glucose as gustatory stimuli for the 
rat, using the electrophysiological re- 
cording method. This could be done 
even though the magnitude of response 
to sugar is relatively small. Figure 6 
shows the relative responses to the two 
sugars, and Figure 7 shows the com- 
parison of the equi-effective concentra- 
tions of the two sugars as determined in 
the bar pressing experiment, in the 
preference choice test, and by the elec- 
trophysiological measure. All measures 
agree that sucrose is more effective than 
glucose of equimolar concentration, al- 
though the quantitative relation is not 
precise, for the electrophysiological data 
is curvilinear, the behavioral data, lin- 
ear. However, it should be remem- 
bered that the physiological data re- 
flect only the chorda tympani response 
and not the taste receptors at the back 
of the mouth. Perhaps the curvilinear 
relation between the behavioral and 
physiological measures arises from this. 

Where ingestion is minimal, behavior 


o EQUAL NEURAL RESPONSE 
o—+ EQUAL BAR PRESS RATE 
+ EQUI SWEET 
1$OHEDONIC 


% SUCROSE 


10 20 
% GLUCOSE 


Fic. 7. Equal response concentrations of 
glucose and sucrose for electrophysiological 
measures, equal bar press rates, equi-sweet 
solutions and isohedonic solutions. (Repro- 
duced from Journal of Comparative and 
Physiological Psychology.) 
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and sensory effectiveness seem to go 
together. Actually the postingestion 
factor seems to be especially significant 
in preferences based on ingestion. The 
glucose preference threshold is lower 
than that for sucrose and the intake of 
glucose greater than that predicted 
by the sensory measures. Soulairac 
(1947) has shown that the relative 
preference for sugars in a typical two- 
bottle preference test is correlated with 
their relative rates of absorption from 
the gastrointestinal tract. 

Experiments utilizing other learning 
situations have been concerned with 
the action of sugar or saccharin as 
reinforcers (Collier & Siskel, 1959; 
Hughes, 1957 ; Sheffield & Roby, 1950; 
Sheffield, Roby, & Campbell, 1954; 
Smith & Duffy, 1957; Young & Shu- 
ford, 1954, 1955). There is ample 
evidence that the taste is re- 
warding whether it is nutritive or non- 
nutritive. Performance here too is an 


sweet 


increasing function of stimulus con- 
centration where conditions appear to 
maximize the sensory effects and min- 


imize the postingestion factor. When 
amount or temporal factor is such that 
the absolute amount or volume of stim- 
ulus or nutrient ingested per unit time 
is low, then purely stimulus factors 
seem best to account for the results. 
When the amount consumed per unit is 
large, either because of a large amount 
of reinforcing agent or short time be- 
tween presentations, then the secondary 
postingestive factors intrude. As Col- 
lier and Siskel (1959) point out, it 
should be possible with the proper com- 
bination of concentration, volume, inter- 
val between reinforcement, and number 
of reinforcements to obtain a monotonic 
stimulus function, a nonmonotonic func- 
tion with inversion points at different 
volumes or concentrations, or even no 
relation at all between amount of rein- 
forcement and performance. 

These considerations presumably ap- 
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ply to Campbell’s (1958) measure- 
ments of the JNR, the just noticeable 
reinforcing difference for sucrose solu- 
tions. He determined the minimal con- 
centration increment necessary to pro- 
duce a 75% preference for the stronger 
of a pair of sucrose solutions in a two- 
bottle preference situation. Stronger 
solutions required a larger concentra- 
tion increment than did the weak ones, 
and the plot of JNR against stimulus 
concentration showed a U-shaped func- 
tion reminiscent of the more familiar 
Weber intensity discrimination func- 
tions. The resemblance may be due 
only in part to the properties of the 
sensory input because, as we have just 
shown, preference is the composite of 
sensory as well as postingestion factors. 
The osmotic effects of the stronger 
sugar solutions might be particularly 
strong. This situation contrasts with 
the JNR functions for the noxious 
stimuli where the stimulus itself may 
be more directly the source of the rein- 
forcement uncomplicated by secondary 
factors (Campbell, 1958). 

Thus we see that sensory stimulation 
of the mouth receptors, especially of 
taste, has a direct relevance for the 
control of ingestive behavior and the 
instrumental responses which lead to 
ingestion. Although a number of in- 
vestigators have provided evidence that 
the mouth receptors may be bypassed 
and that learning can still take place, 
all such evidence shows that such learn- 
ing is not as effective as when stimula- 
tion of the mouth receptors is included 
(Coppock & Chambers, 1954) ; Miller, 
1957). Nearly all workers, whatever 
their theoretical predilections, have 
shown the importance of stimulation 
and “sensory contact,” some would say 
“sensory satisfaction” (Smith & Duffy, 
1957) in the reinforcement process. 
Indeed as Bindra (1959) concludes in 
his recent monograph on motivation: 
“Whatever the interpretation, it seems 
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clear that, up to a certain point, an in- 
crease in sensory stimulation is a posi- 
tive reinforcer” (p. 134). 

The fact that certain taste stimuli 
control ingestion directly appears to be 
biologically determined. Frings (1946) 
has pointed out that nearly all organ- 
isms accept sugar solutions. There are 
exceptions perhaps related to certain 
aspects of metabolic or other biochem- 
ical divergences among the species. 
Cats cannot taste or discriminate sugar 
solutions (Carpenter, 1956; Pfaffmann, 
1955), and birds (Pick & Kare, 1959) 
do not appear to show strong sucrose 
preferences ; but these examples are re- 
markable largely for their divergence 
from what otherwise appears to be a 
Further there is no con- 

ne ing evidence that the “sweet tooth,” 
where it does exist, depends upon the 
concomitant nourishment. The drink 
ing of nonnutritive saccharin solutions 
prolonged exposure to them 
shows no such as 
might be expected if the preference for 
Saccharin or “sweet” were acquired 
by past association with nourishment 
(Sheffield & Roby, 1950). Here we 
have “sweet for sweet’s sake.” 

On the other side of the coin, we 
have heen able to show that a “bitter” 
aversive stimulus can be made more ac- 
ceptable only temporarily when it is 
paired with the alleviation of thirst in 
early infancy (Warren & Pfaffmann, 
1958). Newborn guinea pigs were 
raised on a normally avoided solution 
of sucrose octaacetate (SOA) as the 
a three 


general rule. 


> 


under 


extinction 


sign of! 


only source of water for week 
period At 
showed the 
SOA. Gustatory stimuli, therefore, ap 
pear to be determined 
as the instigator of consummatory or 
avoidance responses and as primary 
positive or negative reinforcing stimuli. 

So far I have discussed the mech- 
anisms of a variety of behaviors which 


an older age these organ- 


isms usual rejection of 


biologically 
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Fic. 8. The preponderance of “pleasant” or 
“unpleasant” judgments in relation to the 
concentration of taste solution. Ordinate 
gives percentage “pleasant” minus percentage 
“unpleasant.” The abscissa is proportional 
to concentration, the full length of the base 
line standing for 40% cane sugar, 1.12% tar- 
taric acid, 10% NaCl, and 0.004% quinine 
sulphate (by weight). (Reproduced from 
Pfaffmann, 1958.) 


appear to be under the control of sen- 
sory stimulation qua “sensory stimula- 
tion.” I have not discussed the ques- 
tion of the affective or hedonic aspects 
of sensory stimulation, the pleasure of 
sensation, so to speak. To do so, I 
should now like to turn to other kinds 
of data derived from studies of man. 


Hedonic Aspects of Sensory 
Stimulation 


Of all the applied psychophysical 
fields, none have made greater use of 
the affective or hedonic rating scale 
methods, along with purely sensory 
testing procedures, than has the field 


of flavor technology. In a series of 
carefully controlled tests the Army 
Quartermaster Food Acceptance Lab- 
oratory (Pilgram, 1957), using a nine 
point hedonic rating scale, has been able 
to predict, with good reliability, the 
actual choices of food and the accept- 
ance of menus on the part of soldiers 
in the field. These ratings frankly ask 
such questions as the following : 
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like extremely—like very much—indif- 
ferent—dislike very much—etc. 


In this case the hedonic ratings are not 
determined solely by stimulus proper- 
ties of food, for other studies by the 
Quartermaster group have documented 
the important role played by familiarity 
and past experience. None the less, the 
frankly hedonic rating initiated by the 
sensory stimulation is a good predictor 
of actual acceptance and ingestion. 

I have been impressed by the ap- 
parent similarity of our rat preference 


curves with those of hedonic value 
obtained many years ago by Engel 
(1928). In those experiments, sub- 


jects were asked to rate different inten- 
sities of taste stimuli as either pleasant, 
unpleasant, or indifferent. The data 
can be treated in a number of ways, but 
in Figure 8 we see the ratings of four 
different taste modalities expressed as 


percentage of pleasant ratings minus the , 


percentage of unpleasant ratings for a 
group of seven observers using summed 
ratings of all observers. The abscissa 
is proportional to the concentrations ad- 
justed for each of the different stimulus 
solutions. Note that sugar begins at a 
slightly unpleasant value and rises with 
concentration to reach a plateau. Sour, 
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ence, rise to a peak, and then fall off 
to unpleasantness. Sweet is predomi- 
pleasant, bitter predominantly 
unpleasant, with salt and sour inter- 
mediate. In Figure 9 I have plotted 
the animal preference curve and the 
hedonic rating by Engel’s Ss for so- 
dium chloride solutions by concentra- 
tion. The hedonic ratings here were 
computed simply as the percentage of 
the total ratings that are pleasant. Ex- 
cept for position along the abscissa, the 
two curves show a striking similarity. 
Beebe-Center (1951) earlier called at- 
tention to these same relations and com- 
pared the animal preference responses 
with hedonic Actually the 
more recent analysis of the postinges- 
tive factors that control the intake of 
solutions has much to 
clarify the discrepancy between Engel’s 
hedonic curve for sugar and the rat’s 
preference for The sucrose 
hedonic curve does not turn down at 
the higher. concentrations (see Figure 
8). As noted earlier, rate of bar press- 
ing on a periodic schedule and fre? 
quency of choice in the brief exposure 
test are likewise monotonically related 
to concentration. Thus hedonic rating 
and reinforcement bear the same rela- 
tion to stimulus concentration. 

In the preference—aversion curves for 
other substances, the stop mechanism 
may be sensory (gustatory) in origin 
or might arise from other than post- 
ingestive effects. If the taste of strong 
salt solutions is aversive per se, then we 
might expect that, regardless of rein- 
forcement schedule, response will fall 
off above the optimal salt concentration 
in a manner resembling the “falloff” in 
Engel’s curves. The same might hold 
for saccharin which has a “bitter” sen- 
sory component at the higher concen- 
trations. Further study of these effects 
is needed. 

[I have been emphasizing stimulus 
properties as prime determinants of the 


nantly 


ratings. 
done 


sucrose 


sugar. 
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hedonic effect, but let me hastily point 
out that specific training and experi- 
ence may make some unpleasant odors 
and tastes acceptable or even preferred. 
I am reminded here of the whiskey 
drinker’s development so aptly de- 


scribed by Brown (1955). 


Straight whiskey, when first ingested, typ- 
ically effects rather violent defense reactions. 
Because of this, the novice drinker usually 
begins with sweet liqueurs, “pink ladies,” and 
wines, and slowly works his way through 
a series of beverages characterized by the 
gradual disappearance of cola and ginger ale 
additives. Finally, only plain water or even 
nothing need be mixed with the raw product. 
To the hardened drinker, straight whiskey 
does not taste bad—not bad at all! (It is 
thus that a product euphemistically labelled 
“neutral spirits” becomes indeed psycholog- 
ically neutral). 


Although this description appeared in 
defense of a drive reduction formula- 
tion of behavior and learning, the mere 
fact that drive reduction was supported 
and buttressed by the use of sweet and 
“pleasant” stimuli seems to have been 
overlooked. There is no convincing 
evidence that the primary relation be- 
tween hedonic tone and sensory stim- 
ulation is entirely acquired. Indeed, 
quite the converse seems to be true. 
The relation of sensory stimulation 
to hedonic process has been noted by 
earlier workers. Sherrington (1906) 
noted that the stimulation of contact 
receptors, as contrasted with distance 
receptors, is characterized by strong af- 
fective tone and that the contact re- 
ceptors stand in very close relation to 
consummatory responses. Stimulation 
of the touch receptors of the lips and 
touch and taste in the mouth initiates 
reflex movements that precede the act 
of swallowing. Troland (1928) spoke 
of three classes of stimulus. reception : 
nociception, beneception, and neutro- 
ception. The principal nociceptor sys- 
tem was pain, but other examples were 
hunger, thirst, the taste of bitter, strong 
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salts and acids, and certain foul or 
repugnant odors, etc. In the beneceptor 
class were the sense organs mediating 
erotic behavior, the taste of sugar, and 
certain other odors. Vision, hearing, 
touch, and proprioception were rele- 
gated to neutroception. Although Tro- 
land’s criterion was that of biological 
utility, Young (1936) pointed to the 
high correlation. between pleasantness 
and beneception, unpleasantness and 
nociception. He also notes that there 
is a frequent correlation between pleas- 
antness and approach, and unpleasant- 
ness and avoidance ; and further that the 
affective responses to simple colors and 
tones are much weaker than those 
evoked by odors, tastes, cutaneous or 
organic stimuli. 

In his work on palatability, Young 


_espouses a clearly hedonistic theory of 


reinforcement in which he treats the 
affective process as a postulate, an in- 
tervening variable (Young, 1959). The 
reactions of animals to taste solutions 
then can only be described when refer- 
ence is made to their positive or nega- 
tive hedonic effects. Thus, sugar solu- 
tions are hedonically positive, and the 
level of acceptability is directly propor- 
tional to the logarithm of the concentra- 
tion. Salt on the other hand does not 
show this one-to-one correspondence 
with sensory intensity. Low concen- 
trations of salt are said to be hedonically 
positive for the rat, but at high concen- 
trations are hedonically negative. Such 
terminology often appears simply to 
rename approach or avoidance behavior 
with hedonistic synonyms for we all 
know what we mean when we say in 
laboratory jargon that the rat likes 
sugar. We know he will take it in 
preference to water, he may press a bar 
for it, he may take it even when it is 
adulterated with something he ‘‘doesn’t 
like” such as quinine. But such usage 
does not necessarily indicate the prim- 
acy of the hedonic process; we must 
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Fic. 10. Lateral and medial surfaces of 
brains of rabbit (A), cat (B), and monkey 
(C) drawn roughly to scale. The drawings 
illustrate that the limbic lobe, represented in 
black, forms a common denominator in the 
brains of all mammals. (Reproduced from 
Journal of Neurosurgery.) 


have further experimental study of ac- 
ceptance and rejection and of their rela- 
tion to hedonic or affective processes. 
Thus it is abundantly clear that, in- 
stigation of consummatory response (or 
of rejection), reinforcement of instru- 
mental responses, and elicitation of he- 
donic effect are all closely related and 
that reproducible stimulus functions can 
be demonstrated for each. A definitive 


choice as to which of these is primary 


cannot yet be made. I would like to 
propose that sensory stimulation per se 
together with its ensuing central neural 
events be considered as a prime deter- 
minant in the chain of events culminat- 
ing in acceptance behavior, reinforce- 
ment, and hedonic effect. The further 
study of such stimulus functions and 
their analysis particularly at the phys- 
iological level is a problem that should 
merit the highest priority. 

I should like to turn to the possible 
physiological mechanisms in the rein- 
forcing and hedonic functions of stimuli. 
PHYSIOLOGY OF REINFORCEMENT 

AND SENsory AFFECT 


Tui 


The studies of cranial self-stimulation 
(Brady, 1958; Olds, 1958; Olds & 
Milner, 1954) show that the reward 
and punishment systems within the 
brain itself, but particularly the reward 
systems, have been found to depend in 


large measure upon the limbic system. 
In classical neuroanatomy this was 
known as the rhinencephalon or smell 
brain, but its relation to olfaction seems 
less and less particular. The limbic 
lobe including the hippocampus com- 
pletely surrounds the hilus of the 
hemisphere. Its various subcortical 
cell stations include the amygdala, 
septal nuclei, the hypothalamus, ante- 
rior thalamic nuclei, etc. The limbic 
lobe of the rabbit, cat, and monkey is 
shown in black in Figure 10. The 
limbic lobe appears to form a common 
denominator in the brains of all mam- 
mals. 


Its constancy of gross and microscopic struc- 
ture throughout the phylogeny of the mam- 
mal contrasts strikingly with the mushroom- 
ing neopallium which surrounds it. . . . Papez 
theorized that the experimental evidence also 
points to the limbic lobe as a cortical com- 
mon denominator for a variety of emotional 
and viscerosomatic reactions in the mammal 
(MacLean, 1954). 


There is increasing evidence of a sen- 
sory viscerosomatic influx into the 
limbic system. In pathological involve- 
ments, particularly of the temporal lobe, 
discharges in this part of the brain may 
be associated with a wide variety of 
auras involving afl the body senses as 
well as a great number of feeling and 
emotional states. Among the more 
purely sensory auras are crude olfac- 
tory sensations; alimentary symptoms 
including taste, thirst, hunger, nausea ; 
and somatic sensations ranging from 
pains to paresthesia. Prolonged rhy- 
thmic responses can be elicited in the 
pyriform area by olfactory, gustatory, 
and painful stimulation (MacLean, 
1954). MacLean notes that all three 
of these senses accent the quality and 
intensity of a stimulus rather than its 
spatial relationships. 

I do not wish to give the impression 
that this system is essentially one for 
viscerosomatic sensation, for it is also 
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implicated in a variety of complex ef- 
fects such as the Kliiver-Bucy syn- 
drome (1958), memory losses (Pen- 
field, 1958), the lowering of rage 
thresholds in some cases or increased 
docility after ablation in others (Bard 
& Mountcastle, 1947; Green, 1958; 
Schreiner & Kling, 1953,; Spiegel, Mil- 
ler, & Oppenheimer, 1940), and a va- 
riety of cranial self-stimulation effects 
(Brady, 1958; Olds, 1958; Olds & 
Milner, 1954). The details of these 
relations are beyond the scope of the 
present What I wish to 
emphasize is that there are significant 
sensory inputs into this system which 
may relate to the hedonic and reinforc- 
ing features of stimulation as compared 
with either the cognitive or arousal 
functions. 

Stellar and his colleagues (Sprague, 
Chambers, Stellar, Liu, & Robson, 
1958; Stellar, Chambers, Liu, Levitt, 
& Sprague 1958) at the University of 
Pennsylvania have made some interest- 
ing studies of chronic animals with ex- 
tensive lesions in the lateral lemniscal, 
primary sensory pathways. I was 
struck with their description of the 
lemniscal animals, i.e., the sensorially 
restricted animals displayed little affect 
in any situation, except, perhaps the 
most extreme. They flex and tend to 
pull away from a pinch, but do not 
attack or show any autonomic response. 
They showed little or no aversive reac 
tion to an ether cone, although they 
lacrimated and sneezed. Prior to the 
operation, they solicited petting and re- 
sponded to it well; afterwards they gave 
no reaction to petting. 

These observations suggest that the 
affective response to sensory stimuli 
might be mediated by the lemniscal and 
not the diffuse arousal system, which, 
it might be noted, already seems to have 
heen theoretically overworked by psy- 
I would like to remind you 
of the “thalamic syndrome,” described 


discourse. 


chologists. 


265 


by Head and Holmes (1911). Lesions 
in the region of the thalamus are char- 
acterized by “overreaction,” by exces- 
sive affectivity, for both pleasurable as 
well as painful sensations. This is 
pathology characterized by the affective 
modification of sensation. Although the 
“syndrome thalamique” is well known 
to clinical neurologists, its exact mech- 
anism is still not clear. For our pur- 
poses, it is sufficient to note that in this 
condition, the basic hedonic responses 
to sensory stimulation, some of the 
“Pleasures of Sensation” as well as 
the displeasures, can be unmasked. In 
short, I am suggesting that the affec- 
tive consequences of sensory stimula- 
tion are mediated by processes that 
depend upon the primary projection 
systems and their ramifications in 
thalamic and old brain neural connec- 
tions. Obviously much work is re- 
quired to place these speculations upon 
a more solid foundation. 


SUMMARY 


And now let me briefly summarize. 
I alluded briefly to the different roles 
of stimuli and sensory processes in the 
behavioral economy of the organism. 
I pointed out that traditionally sensory 
processes were studied largely in rela- 
tion to discrimination or so called cog- 
nitive functions. But stimuli have been 
shown to have other neural and behav- 
ioral functions as arousal and reinforce- 
ment. I ther went on to discuss in 
further detail certain of our own ex- 
periments on taste as a model system to 
show the relation of gustatory stimula- 
tion to the control of consummatory re- 
sponses and the reinforcement of in- 
strumental behavior. It was shown 
that there is increasing support for the 
idea that gustatory stimulation per se is 
capable of eliciting and reinforcing be- 
havior in its own right. We might say 
“sweet for sweet’s sake.” I then led 
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into the problem of hedonic or affective 
responses to such stimuli ; and, although 
a good correlation between affective 
response and reinforcement could be 
demonstrated, a statement as to their 
exact relation must await further study, 
particularly at the physiological level. 
Finally, I speculated as to the psycho- 
physiological mechanisms that might 
underlie reinforcement and the affective 
responses to sensory stimuli. My basic 
theme has been that sensory stimulation 
“qua stimulation” plays a significant 
role in the motivation as well as guid- 
ance of behavior—euphemistically we 
might say, in controlling behavior for 
the “Pleasures of Sensation.” 
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THEORETICAL NOTES 


THE CONCEPT OF STATISTICAL SIGNIFICANCE AND THE 
CONTROVERSY ABOUT ONE-TAILED TESTS 


H. J. EYSENCK 


University of London 


Several controversial papers regarding 
the uses and abuses of the one-tailed test 
of significance have recently appeared 
(Burke 1953, 1954; Goldfried 1959; Hick 
1953; Jones 1952, 1954; Marks 1951, 
1953). As Goldfried (1959) points out, 
“the important question debated is not if 
it should be used, but rather when it 
should be used.” It is suggested here 
that most of the disagreements emerging 
from this controversy stem from a misun- 
of the term “significance,” 
and it ts further suggested that the same 
misunderstanding runs through many dis- 
cussions of two-tailed tests as well. It 
will be suggested that in the sense in 
which Goldfried’s statement is meant, it 
has been the wrong question which has 
neither one-tailed nor two- 
should be used at all in the 
sense envisaged by most of the writers 
quoted. 

The outcome of the statistical examina- 
tion of experimental results is always 
stated in terms of the probability of dis- 
confirmation of the null hypothesis; the 
set of values which these’ p values can 
take is continuous in the interval from 
0 to 1. It is customary to take arbitrary 
p values, such as .05 and .01, and use 
them to dichotomize this continuum into 
a significant and an insignificant portion. 
This habit has no obvious advantage, if 
what is intended is merely a restatement 
of the probability values; these are al- 
and are far 


derstanding 


been debated ; 


tailed tests 


ready given in any case 
a simple dichotomous 
gross absurdities re- 
sult from taking these verbal statements 
too seriously; the difference between a 
C.R. of 1.90 and another of 2.00 is quite 
negligible, vet one falls on one side of 
the dichotomy, the other on the other 


side. This has led to such summary 


than 


more precise 


statement Indeed 


statements as: “almost significant,” or 
“significant at the 10% level.” If the 
verbal dichotomous not satis- 
factory—as it clearly is not—the answer 
surely is to keep to the continuous /p 
scale, rather than subdivide the verbal 
scale. 

However, surplus meaning has accrued 
to the.word “significant,” and it has be- 
come a shibboleth which divides the suc- 
from the unsuccessful research. 
frequently interpreted as almost 
reproducibility of results, 
while failure to reach significance is in- 
terpreted as disconfirmation. Hence the 
urgent desire to achieve respectability and 
significance by one-tailed tests, if need 
be, and the argument regarding when the 
cachet of “significance” can be bestowed 
upon a research result. Yet the argu- 
ment, and the achievement or nonachieve- 
ment of significance, do not alter the 
facts of the case, which are contained in 
the statement of the p value of the re- 
sults. Anything beyond these facts de- 
pends upon interpretation, and is sub- 
jective; it does not alter the facts of the 
case in the slightest. 

As an example of the necessity of inter- 
pretation, consider the a priori proba- 
bility of the conclusion. Suppose that an 
experiment on ESP were carried out with 
all the precautions which human ingenu- 
ity can devise, so that even the most 
sceptical had to agree that no fault could 
be found with the experimental design. 
Suppose also that a p value of .05 were 
Would this be considered “sig 
sense of guaranteeing 
Critics would point out 
quite rightly that where the a priori 
probability is very low, as in this case, 
much higher p values would be required 
to carry Logicians are 


scale is 


cessful 
It is 


guaranteeing 


achieved. 
nificant,” in the 


reproducibility ? 


significance. 
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agreed that interpretation of experimental 
results must call on all available knowl- 
edge about the subject in question; a 
priori probability is a kind of summary 
statement of much of this knowledge. It 
cannot be overlooked in arriving at a 
conclusion regarding “significance” when 
the term carries the surplus meaning in- 
dicated. 

That interpretation into the 
problem very much is clear when we look 
at such conditions as those suggested 
by Kimmel (1957) as criteria for the 
use of one-tailed tests. He suggests, for 
instance, that they may be used if re- 
sults in the opposite direction would be 
psychologically meaningless or could not 
be deduced from any psychological theory. 
These are obviously not objective cri- 
teria, but depend on what the author (or 
reader) considers psychologically mean- 
ingless, or the kind of theory he may 
hold. Opinions will differ, and conse- 
quently some readers will agree to the 
use of the one-tailed test in a particular 
case, others will not. Thus to some read- 
ers the results will appear significant, to 
others insignificant. 

The whole argument seems to be about 
words, not about facts: Is the word “sig- 
nificant” to be used in a given situation, 
or is it not? This would only matter if 
the word carried some objective mean- 
ing not contained in the probability fig- 
ures; we have argued that it does carry 
surplus meaning, but that this is not of 
an objective kind. Consequently, nothing 
important is changed by omitting the 


come } 


term altogether in the report, leaving in- 


terpretation to the reader. After all, the 
only true proof of reproducibility is re- 
production! Verbal assertions of “sig- 
nificance” have no more meaning than 
the droit du pour at the court of Louis 
XIV. 


The solution is to separate quite clearly 


‘ 


and decisively the objective statement of 
the probability of disproof of the null hy 
pothesis (by means of a two-tailed test), 
and the subjective evaluation and inter 
pretation of the results. The 
would be able to accept the first state- 
ment as a statement of fact and would 
then be able to judge for himself the ar- 


reader 


guments presented by the author regard- 
ing the meaning of these facts. These ar- 
guments might be based on results of 
previous experiments, predictions made 
on the basis of more or less widely ac- 
cepted theories, number of cases involved, 
a priori lack of accepability of the con- 
clusions, and other similar grounds: an 
explicit statement of the arguments would 
enable the reader to decide for himself 
the acceptability of the conclusions in a 
manner precluded by the simple state- 
ment of one-tailed probability. A state- 
ment of one-tail probability is not a state 
ment of fact, but of opinion, and should 
not be offered instead of, but only in 
addition to, the factual two-tailed proba- 
bility; if it is offered at all, it should be 
accompanied by a full statement of the 
arguments in favor of its facilitating a 
more meaningful interpretation of the 
data. In the writer’s opinion, it would 
be better to drop such statements of one 
tailed probability altogether and rely en 
tirely on appropriate argumentation to 
establish the meaning of the observed 
(two-tailed ) probabilities. 

Implicit in this recommendation 
corollary that the mechanical evaluation 
of experimental results in terms of “sig 
nificant” and “not significant” be dropped 
outright. Interpretation is implicit in the 
statement of one-tailed probabilities, but 
it is also implicit in the statement of two 
tailed auto 
matically interpreted as being significant 
or not significant 
meaning these 
experimenter should give his (two-tailed ) 
p values and then proceed to argue re- 
garding the acceptability of the conclu 
sions on the 
There 


solemn 


is the 


probabilit‘es if these are 


with all the surplus 


carried by terms. The 


indicated. 
literature 


basis already 
appeared in the 
about the 


causes for discrepancies between two ex 


have 
discussions possible 
periments, one of which gave significant 


the other insignificant results: vet the 
respective ¢ values were almost identical 
one side, the other 
of the 


Such arguments are unrealistic 
and would be avoided if p values 


one lving just on the 


just on the other side, arbitrary 


5% line. 
were 
compared, rather than verbal statements 
Two experiments giving p values of .048 
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and .056 are in excellent agreement, al- 
though one is significant, while the other 
is not. 

To summarize the main point of this 
note briefly, we would say that verbal 
statements regarding “significance” are 
at best supererogatory restatements in an 
inconvenient dichotomous form of results 
already properly stated in terms of a 
continuous system of p values; at worst 
they carry unjustified surplus meaning 
of an entirely subjective kind under the 
guise of an objective and mathematically 
meaningful statement. Subjective judg- 
ments of reproducibility cannot reason- 
ably be based on the mechanical applica- 
tion of a rule of thumb whose only use- 
fulness lies in the elementary instruction 
of undergraduates lacking in mathemati- 


cal background; if they are to be made at 


all they demand complex consideration of 


probabilities. It is suggested 
factual statement 
of probabilities should be 
mandatory and that all subjective con- 
arguments, and judgments 
clearly separated from such 


a priori 
that the accurate and 


(two-tailed ) 


siderations, 
should be 
implied that 


factual statements. It is 


judgments of “significance” belong with 
the subjective side, and it is also implied 
that the calculation of p values on the 
basis of one-tailed tesis has no place in 
psychology. 
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\ NOTE ON THEORY AND METHODOLOGY IN THE 


STUDY OF FIGURAL 


AFTEREFFECTS 
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Recently, Mathews and Wertheimer 
lave derived a theorem from the Kohler- 
Wallach (1944) theory of figural after- 
effects in order to provide an experimen- 
tal test to discriminate between that 
theory and the Osgood-Heyer (1952) 
theory. This test must fail, however, be- 
cause the principle that is invoked is 
contained in both theories, and the same 
prediction is at least as readily derived 
from either. Instead, the contribution of 
Mathews and Wertheimer lies in drawing 
attention to some commonly overlooked, 
but logically necessary consequences of a 
satiation-type theory. The rule that dis- 
placements will be uniformly symmetrical 
with respect to the contours of any I- 
figure has come to be accepted as a gen- 
statement of Kohler’s theory and 
empirical findings. For example, Sagara 
and Oyama (1957) reported several ex- 
which asymmetrical dis- 
placements were found as a contradiction 
of Kohler and Wallach (1944). The 
interpretation that Kohler and Wallach 
gave for this same finding in their orig- 
inal paper is rejected by Sagara and 
hoc suspension of 
In part, the object 
of the following analysis is to show that 
asymmetrical displacements are a logically 
consequence of Kohler’s as- 
sumptions in a wide variety of situations, 
while the rule of symmetry only applies to 
a restricted class of figural aftereffects. 

The satiation of Kohler and Wallach 
and the neural adaptation of Osgood and 
Heyer are processes which arise from the 
contours of a figure when it is seen 
against a contrasting field. Satiation 
spreads to regions about the contours of 
the figure in a manner determined by elec- 
trotonic field-forces assumed to exist in 
the visual projection area of the cortex. 
Veural adaptation spreads about the con- 
tours in a manner determined by the 
statistical spread of neural excitation in 


eral 


periments in 


Oyama as a special ad 


the rule of symmetry. 


necessary 


Massachusetts 


the visual cortex caused by “the simul- 
taneous action of physiological nystagmus 
and reciprocal overlap of dendritic proc- 
esses” (Osgood & Heyer, 1952, p. 102). 
As Osgood and Heyer freely admit, once 
each theory has accomplished its genu- 
flections to the particular sort of ritual- 
istic physiological speculation favored by 
its authors, the results are essentially 
equivalent. With time, a gradient of 
satiation (or neural adaptation) is formed 
about the contour. The gradient is high 
est directly over the contour and decreases 
with distance in either direction, tending 
to be sigmoid in shape due to an assumed 
flattening at the peak. In general, dis 
placements will be from regions where 
the gradient is high to regions where it is 
low. The derivation up to this point is 
illustrated in Fig. 1, where CD represents 
a line presented as an I-figure, P and P’ 
represent the loci of T-figures presented 
after CD has been removed, and the ar- 
rows indicate the direction of displace- 
ments. If either P or FP’ is brought too 
close to CD, the gradients will extend 
symmetrically in both directions so that 
no displacement can occur. The extent of 
displacement has been found to increase 
with distance from the contour up to a 
maximum and then decrease again (the 
distance paradox). To account for this, 
both theories assume that the extent of 


displacement is a complex function of the 
height and the steepness of the gradients. 
Such symmetry of direction and extent, 
with CD representing the axis of sym- 
metry, can be expected only as long as the 
I- and T-figures are analogous to the 
diagram of Fig. 1 which will be referred 


to here as Paradigm I. The analogy 
will continue to hold when the I-line, CD, 
is widened to become an oblong or square, 
but only so long as the T-figure does not 
become appreciably smaller than the I- 
figure. 

Paradigm I is the 


illustration most 
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D 


Fic. Paradigm I. 


mentioned in the theoretical 
of both Kohler and Wallach 


frequently 


discussions 


and Osgood and Heyer. Also, analogies 


of Paradigm I comprise the more in- 
tuitively obvious and striking of the vari- 
ous demonstrations reported by Kohler 
and Wallach, as well as those most re- 
liably confirmed by subsequent investiga- 
tors.' Thus, it is understandable that 
this simple paradigm has come to be ac- 
cepted as the general model of figural 
aftereffects according to satiation theory. 
As Kohler would say, “Nothing could be 
from the truth.” Consider the 
Fig. 2. Here the square 
ABCD represents a solid I-figure pre- 
sented against a contrasting field (e.¢., 
The point 
P represents the locus of a T-figure ap 
than ABCD 
contours of the I- 
and near the CD, while 
© represents the locus of a similar T- 
figure presented near CD but within the 
The solid ar- 


farther 


diagram in 


bl ack square on white field ). 


. 1 | a 
preciably smatier 


that is 
presented outside the 


contour 


horure 
gure 


contours of the I-figure. 


1 Extensive reviews of this literature have 
been published recently by McEwen (1958), 
Sagara and Oyama (1957), and Spitz 
(1958). 
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rows represent the symmetrical displace- 
ments on either side of CD that would be 
predicted merely by superimposing the 
I-line of Paradigm I on the new diagram. 
Clearly, this would not be justified. Un- 
less the I-figure is so wide or the I-period 
is so short that the tissue satiating cur- 
rents flowing from the opposite contour are 
negligible, displacement at Q should not 
be symmetrical with displacement at P. 
Without doing violence to the rest of 
the theory it is necessary to suppose that 
the satiating currents arising from the 
opposite contours, AB and CD, combine 
with each other in many cases. As a 
result, the satiation at loci inside the con- 
tours will always be greater than that at 
an equal distance outside the contours. 
At first, the cross sectional pattern of 
satiation will be saddle-shaped with the 
level of satiation in the cleft of the saddle 
rising more rapidly than that of the out- 
With time a 
relatively flat dome-shaped pattern will 

hich spans the distance between 
Eventually, a homogeneous 
will be obtained be- 
extending for 
some distance beyond them (Kohler & 
Wallach, 1944, p. 343-344). A precise 
statement of the rate at which each stage 
will appear and then to the 
next is bevond the scope of the theory in 
its present form. Nevertheless, with a 
suitable choice of I-figure, T-figure, and 
I-period, it should be possible to find a 
locus Q between the area of greatest sati- 
ation and the contour CD such that the 
direction of displacement will be towards 


ward sloping gradients. 
emerge w 
the contours. 
ition 


contours 


degree of satiz 


tween the and 


vive way 
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I- and T-figures used by Mathews 
and Wertheimer (1958). 


CD. This is broken 
arrow in Fig. 2. 

Despite the claim of Mathews and 
Wertheimer (1958), which they have 
documented with a personal communica- 
tion from Osgood, precisely the same 
theorem can be derived from the Osgood- 
Heyer theory. Osgood and Heyer (1952, 
p. 108) explicitly state that the statistical 
distributions of neural adaptation arising 
from opposite contours must overlap and 
summate with each other in a manner 
analogous to the satiation of Kohler and 
Wallach. Their Fig. 2 (1952, p. 105) 
is a sectional representation of 
Paradigm I. It becomes Paradigm II 
merely by superimposing upon it the 
overlapping gradient of neural adaptation 
which might be expected from a second 
contour. If this is done, the predictions 
of the Mathews-Wertheimer theorem 
must follow. Of obtain a 
prediction for a specific I-figure it will 
no longer be possible to avoid specifying 
the parameters of the statistical distribu- 
tions as it was in the original analysis 
presented by Osgood and Heyer. 

Mathews and Wertheimer (1958) per- 
formed an experiment which they de- 
signed as a test of their theorem. The 
I- and T-figures they used are drawn to 
from their description in Fig. 3. 
The I-figure solid, black, 2-in. 
square corresponding to ABCD in Fig. 2, 
while TL, TC, and TR were three %4-in. 
squares presented simultaneously as a T- 
figure after a 2-min. inspection of the I- 
figure. The locus of TL corresponds to 
locus Q in Paradigm IT. Each of 16 
observers (Os) asked to report 
which distance was greater, that between 


indicated by the 


cross 


course, to 


scale 
was a 


was 


TL and TC or that between TC and TR, 
and 14 chose the right hand distance as 
the greater. According to Mathews and 
Wertheimer: “This result favors the 
Kohler-Wallach prediction, by bimodal 
expansion, at the 1% level.” Since no 
further data are submitted, we must sup- 
pose that this probability level was ar- 
rived at by assuming that 16 other Os 
who had not inspected the I-figure—that 
is, a control group—would have been 
equally divided in their choices. Un- 
fortunately, those of us who do not con- 
cur in this admittedly common practice 
of assuming the performance of a control 
group must remain in doubt as te the out- 
come of the experiment. Further elabo- 
ration of this point is unnecessary as the 
importance of such a control in the study 
of figural aftereffects has already been 
length by Brown 
(1954). 
with a 


discussed at 
(1953) and Wertheimer 
In any event, even 


some 


control 


group, the particular T-figure chosen by 


Mathews and Wertheimer could never be 
used to study the displacement of the 
All objects in the T-figure 
must be assumed to exhibit 
placement. Consequently, the best pos 
sible measurement of the extent or direc 
tion of displacement is the relative 
displacement of a near T-object 
compared with that of a second, remote 
T-object. When the relative displace 
ments of both the near and the remote 1 
objects are referred to that of an inter 
mediate, 


square TL. 


some dis 


when 


T-object no meaningful 
result can be obtained. The Os could 
report that the distance TC—TR is 
greater than the distance TL—TC when 
both TL and TC are displaced away from 
the contour in any case where the dis 
placement of TR is greater than the com- 
bined displacement of TL and TC. In- 
this might commonly 
due to the distance paradox. Similarly 
the distance TL—TC could be reported 
as greater even when TL is displaced 
towards the contour in any case where 
the displacement of TC away from the 
contour is greater than the combined 
displacement of TL and TR. The ubiq- 
uitous distance paradox would readily 
account for this outcome also, 


third 


deed, occur very 
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Fortunately, Fox (1951) has already 
reported two technically more adequate 
experiments that fulfill the topological 
requirements of the Mathews-Wertheimer 
theorem. In Experiment I, Fox found 
that T-figures presented on the contour 
or just inside of it were displaced out- 
wards, but to a smaller extent than T- 
figures presented outside the contour. 
Using somewhat different I-figures, in 
Experiment II he found that displace- 
ments were symmetrically away from the 
nearest contour fora variety of loci on 
either side of the contour. Fox concluded 
in favor of Kohler and Wallach on the 
basis of Experiment II, attributing the 
asymmetrical results of Experiment I to 
an artifact not present in Experiment II. 
It might at first appear that in order 
to preserve the Mathews-Wertheimer 
theorem, Fox should have based his con- 
clusion on the 
and explained away the symnsetrical re- 
sults of Experiment II. But this, too, 
would oversimplify the situation and un- 
derestimate the full power of the satiation 
theory. Clearly, both the initial pattern 
of satiation and its variation as a func- 
tion of time must depend upon the par- 
ticular I-figure that is in question. In 
the complex situation described by Para- 
digm II Fox’s results can be accounted 
for simply by noting that the I-period he 
used must have been long enough to pro- 
duce asymmetrical displacements with 
the I- and T-figures of Experiment I, 
but not with those of Experiment IT. 

The Mathews-Wertheimer theorem rep- 
resents Case 1 of Paradigm II. It is 
also possible to derive its converse, Case 
2, where the direction of displacement at 
loci outside the contours reverses with 
time under certain conditions. That this 
case, too, is implied as a consequence of 
satiation theory was noted by Kohler and 
Wallach (1944): 


results of Experiment I 


Suppose that the T-squares lie half inside 
and -half outside the area of the I-oblong [on 
opposite contours]. If the inspection period 
is not too short the distance between these 
squares will decrease. Even with T-squares 
which lie entirely outside the oblong but 
just adjacent to it the same will be observed 

always provided that the inspection period 
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has been sufficiently long ... the direction 
of the effect depends upon the degree in 
which the medium has been altered by pre- 
vious inspection. Also, during late stages of 
the inspection period a figure process may 
not affect the medium in quite the same way 
as it does in the beginning. Incidentally, in 
the same arrangement slight variations of 
the I-object may turn the effect in one direc- 
tion or the other (p. 297). 


The existence of Case 2 displacements 
towards the contour that Kohler and Wal- 
lach reported have since been confirmed 
by Smith (1954) and by several Japanese 
studies reviewed by Sagara and Oyama 
(1957). The explanation seems to de- 
pend upon the existence of I-figures which 
give rise to gradients which are steeper 
inside the contour than outside the con- 
tour. Both Kohler and Wallach (1944, 
pp. 339-340) and Osgood and Heyer 
(1952, p. 108) invoke this principle to 
account for an otherwise anomalous dis- 
crepancy between outline and solid fig- 
ures. In such a case the peak will be 
shifted outwards somewhat, and further 
summation and interaction with the gra- 
dient extending from the opposite con- 
tour should increase this shift. The 
effect will not always be observed, of 
course; but with a suitable choice of I- 
figure, T-figure, and I-period, it should 
be possible to find a locus / outside the 
contour such that displacement will be 
towards the contour. In connection with 
this, Sagara and Oyama (1957) report 
that where Case 2 reversals were found 
the extent of nonreversal displacement 
was greater inside the contours than out- 
side. This is precisely the result that 
would be expected if the inside gradient 
is steeper than the outside gradient in 
Case 2. 

There is essentially no contradiction 
between the predictions of Case 1 and 
those of Case 2. Each is appropriate to 
a separate class of I-figures. It is in- 
that different classes 


tuitively obvious 
of I-figures must give rise to different 
patterns of electrical activity in the visual 
To claim otherwise is to do vio- 
However, the various 
patterns are related by a common neuro- 
According to Kohler 


cortex. 
lence to language. 


logical mechanism. 
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and Wallach, this mechanism is the elec- 
trotonic field-forces of satiation. Accord- 
ing to Osgood and Heyer, it is the statis- 
tical distribution of neural adaptation. 
Neurological evidence has been cited on 
both sides. To be sure, the neurological 
data available to the authors were not 
sufficient to permit either theory to spe- 
cify the particular pattern of cortical 
activity that would be produced by any 
particular set of conditions. This is un- 
derstandable when we consider that the 
evidence that is cited pertains to phe- 
nomena that take place within fractions 
of a second, while figural aftereffects can 
extend several minutes at least. 
But, with disarming frankness, both Koh- 
ler and Wallach and Osgood and Heyer 
admit the need for much further research 
in this area. In the interim, the gap is 
bridged by boldly assuming that in cases 
of Paradigm I the long-term neurological 
processes will be analogous to the short- 
term while in cases of Para- 
digm II summation and interaction over 
time will produce precisely the gradients 
that are account for the 
figural aftereffects Thus, 
whether the ogists of the 
future opt for field-forces, or statistically 
independent units, or—as in modern 
hysics—some happy combination of the 

it is probably safe to say now that 
perceptual phenomena are related to neu- 


ral activity. 


over 


processes, 


necessary to 
that occur. 
1 


neurophysio 
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SCALE CLASSIFICATION AND STATISTICS 


JOHN GAITO 


Wilkes College 


The classification of psychological scales 
into nominal, ordinal, interval, and ratio 
categories by Stevens (1951) has been a 
significant contribution to psychophysical 
and measurement theory. This classifica- 
tion has been widely accepted and applied 
by psychologists in conversation, teach- 
ing, and in publication (e.g., Edwards, 
1958; Guilford, 1954; Senders, 1958; 
Siegel, 1956). In considering these scales 
Stevens has specified the appropriate sta- 
tistical measures for use with each. This 
latter aspect has had a tremendous im- 
pact on writers of statistical texts. For 
example, these categories form the basis 
for the texts by Senders (1958) and by 
Siegel (1956). There is no doubt that 
this classification has great practical and 
theoretical value. However, we feel that 
the application of this classification has 
led to some misunderstanding in regard 
to the use of various statistical techniques 
and to an overemphasis on the utility of 
nonparametric techniques in psycholog- 
ical research, 

As an example of this tendency Siegel 
(1956, p. 26) maintains that nonpara- 
metric tests of significance should be used 
He lists the 
uirement of interval scale as one of the 


with subinterval type data. 


sumptions for the use of the analysis of 
variance. 
not be 
ematical bases of assumptions ( Eisenhart, 


However, this assumption can- 


1 


found if one looks to the math- 


1947). In fact a noted statistician, Kemp- 
thorne (1955), after showing mathemat- 
ically that the normal theory analysis of 


variance test can approximate the ran 
domization test, states that 


This serves as some theoretical basis for 
the fact which has been noticed by most 
statisticians, that the level of significance of 
the analysis of variance test for differences 
between treatments is little affected by the 
choice of a scale of measurement for analysis 
[italics added]. 


The important consideration for the use 
of the analysis of variance is not that the 
data have certain scale properties but that 
the data can be related to the normal dis- 
tribution, plus approximating the other as- 
sumptions of independence and homogen- 
eity of errors. Furthermore. it must be 
emphasized that the normal distribution 
refers to the distribution of those portions 
of the data which are used as the appro- 
priate estimates of error and not neces- 
sarily to the distribution of the observa- 
tions. This is an important consideration 
because in multivariable designs the var- 
iation within each group may not dis- 
tribute normally but the portions of cer- 
tain interaction terms may. In many of 
these designs some of the interaction 
effects provide the valid estimate of error 
in testing certain effects. 

It would seem that the comments con- 
cerning the relation between the various 
scales and the appropriate statistical pro- 
cedures should serve merely as a guide. 
In some cases empirical data may indicate 
that rigid adherence to such statistical 
methods is not required and wasteful of 
the data. The author has propounded 
elsewhere this viewpoint and cited rel- 
evant data concerning the analysis of 
variance technique (Gaito, 1959). 

Furthermore, the same data may be 
considered to have the properties of two 
or more scales, depending on the context 
in which it is considered. For example, 
if we look at the response of one subject 
(S) to a single item, the properties of the 
data are those of a nominal scale, i.e., 
right or wrong. However, if we concen- 
trate on the total score for one S or the 
total scores for a group of Ss, we have at 
least an ordinal scale. This is similar to 
the situation of approximating the bi- 
nomial distribution (a distribution hav- 
ing nominal, or possibly ordinal, scale 
properties) by the normal distribution (a 
distribution having at least ordinal, pos- 
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interval, scale properties) when n 
increases and p is close to .5. Another 
example is the relation between the sign 
test and the binomial distribution test. 
For a given set of data these two methods 
will give the same result; yet Siegel lists 
the former as of an ordinal nature, the 
latter as nominal data. 

It appears that in dealing with cescrip- 
tive statistics the comments made about 
the use of nonparametric statistics for 
subinterval-type data are most appro- 
priate. However, in dealing with tests of 
significance such statements are not as 
appropriate, even though descriptive sta- 
tistics are involved in the procedure. For 
each test the mathematical requirements 
are expressed in the assumptions, and 
nothing concerning specific scale prop- 
required. Supplementing the 
requirements will be evidence to indicate 
the effects on the data of failure to meet 
the assumptions (Gaito, 1959; Lindquist, 
1953). Thus if empirical research indi- 
cates little effect on the data when as- 
sumptions are not met or merely approx- 
imated, the investigator may use a para- 
metric technique and allow for the effects 


sibly 


erties is 


of the deviations from assumptions in his 
interpretation. 
It is encouraging to note that some 


have been reluctant to em- 
brace wholeheartedly the nonparametric 
technique (e.g., Guilford, 1954; Mc- 
Nemar, 1957, 1958; Savage, 1957). In 
a review of the statistical literature in the 
Annual Review of Psychology, Grant 
1959, p. 137) appropriately summarizes 
this position by the comment: “Some 
much-needed negative thinking has re- 
cently appeared on nonparametric tech- 
niques” | italics added]. 


individuals 


In conclusion we wish to restate that 
the statements concerning scale proper 
ties and statistical procedures should be 
guides and that context, mathematical as- 
sumptions of statistical procedures, and 
the results of research concerned with 
failure to satisfy assumptions should be 
the ultimate determiners of the choice of 
statistical techniques. 
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